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I. INTRODUCTION 

The study of decays of the B® meson (with a quark 
content of bs) and the B° meson (bd) into a dimuon pair 
(/j + /i~) has long been of great interest as a test of the 
standard model (SM) of particle physics. These flavor- 
changing neutral-current (FCNC) decays occur in the 
SM only through weak-interaction-mediated loop pro- 
cesses whose amplitudes are suppressed via the Glashow- 
Iliopoulos-Maiani mechanism [l| and by hclicity conser- 
vation. In the SM, the branching fractions for B® — > 
and B° fx + yT arc predicted to be (3.2 ± 0.3) x 
lfT 9 and (1.1 ± 0.1) x 10~ 10 , respectively 0). Note 
that the inclusion of charge conjugate modes is implied 
throughout this paper. Non-SM particles in the loop pro- 
cesses or non-SM coupling mechanisms can significantly 
alter the rate of these decays so that measurements of 
their branching fractions serve as powerful tools to probe 
for the effects of new physics beyond the SM. For exam- 
ple, in the minimal supersymmetric standard model the 
B® — > decay rate is proportional to (tan/3) 6 0- 

Q , where tan (3 is the ratio of the vacuum expectation 
values of the two Higgs fields. The decay rate can be 
enhanced relative to the SM by over two orders of mag- 
nitude at large tan f3 values. The search is also sensitive 
to supersymmetry (SUSY) in cosmologically consistent 
scenarios Other models, such as R-parity violating 

SUSY @, littlest Higgs model with T-parity or mod- 
els with extra dimensions [ll], EH , predict large effects 
independent of the value of tan j3. Substantial negative 
interference effects can suppress the B° s — > /i + branch- 
ing fraction by as much as a factor of three in portions of 
the SUSY parameter space [l3| . In the absence of an ob- 
servation, limits on B(B® — > /i + /j - ) are complementary 
to limits provided by direct searches in constraining the 
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new-physics parameter-space. The status of constraints 
on new physics in a variety of different models and in a 
model independent treatment are discussed in Ref. fl4j . 

Recent results on B{B®(B°) — > n + p~) include limits 



from the ATLAS [l5|, CMS [16j, DO [17J, and CDF [18 1 
experiments. The most sensitive result is from the LHCb 
experiment [l9[ , which reported an excess of B a 
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events and measured B{B° S 
and an upper limit for the B(B a 
of about nine of the SM B° 
CDF result reported B(B° S -> fx 
and sets a two-sided interval at 90% C.L. of 4.6 x 10~ 9 < 
B(B° S /i + /i~) < 3.9 x 10~ 8 and an upper limit of 
B{B° -> < 3.5 x 10~ 8 at 90% C.L. The B° s and B° 

results from all experiments are compatible and indicate 
that there is no strong enhancement in the B® — > /i + 
decay rate. Further measurements of B(B® — > /i + are 
likely to constrain strongly new physics models predicting 
significant deviations from the SM predictions. 

We report on a search for B® — > n + ^~ and B° — > 
decays using the complete Run II data set of pp 
collisions at = 1.96 TeV collected by the upgraded 
Collider Detector at Fermilab (CDF II) and correspond- 
ing to an integrated luminosity of 10 fb _ . Because the 
previous CDF analysis [l8[, using 7 fb _1 of integrated 
luminosity, reported an excess of B® — > ji + signal 
events, the same analysis methodology is applied to the 
full available data set. The sensitivity of the analysis 
reported here is improved with respect to that reported 
in Ref. [l|[ due to the 24% increase in event-sample size. 
All other aspects of the analysis have remained the same. 



II. EXPERIMENTAL SETUP: THE CDF II 
DETECTOR 

The CDF II detector is a general-purpose detector [20l — 
[22l | with cylindrical symmetry (Fig. [I} designed to detect 
products of pp collisions at a center-of-mass energy of 
%fs = 1.96 TeV. A cylindrical coordinate system is used 
to describe particle trajectories. The z axis is defined as 
the direction of the proton beam. Besides the azimuthal 
angle 0, radius relative to the beam line r, and polar 
angle 6, we define a pseudorapidity i] = — In (tan(6>/2)). 
The transverse momentum, pt, represents the compo- 
nent of a particle's momentum in the plane perpendicular 
to the beam axis, pt = psin(9. 

The most important subdetectors for this analysis are 
briefly described below and include the tracking system 
and the muon system. Additional subsystems such as the 
calorimeters and luminosity detector system also play a 
role in the analysis. The calorimeters are used in part 
of the particle identification process, while information 
from the luminosity detector system is used in some of 
the background estimations. A more detailed description 
of the CDF II detector can be found in Ref. (H. 
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FIG. 1: Cutaway isometric view of the CDF II detector. 



A. Tracking system 

The tracking system consists of silicon microstrip 
detectors, a multi-wire open-cell drift chamber, and 
a solenoidal superconducting magnet. The innermost 
tracking system, LOO, is a single-sided silicon microstrip 
system mounted on the beam pipe [23] ], Outside LOO 
is the SVXII detector, with five layers of double-sided 
silicon microstrip sensors [HI- One side of each sensor 
provides azimuthal {rip) information while the opposite 
side provides longitudinal information (rz). The SVXII 
hit resolution is 11 /im, while the impact parameter res- 
olution for charged particles with px > 2.0 GcV/c is 
about 40 /im, which includes a 35 /xm contribution due 
to the size of the pp luminous region. The association 
of SVXII rz hits allows measurement of the z coordinate 
of charged-particle trajectories (tracks) at the pp inter- 
action point with 70 ^m resolution. The combination of 
excellent r<j> and z resolution allows precise determina- 
tion of the three-dimensional spacepoint defined by the 
B®(B°) -4 decay vertex and the rejection of back- 

ground from pairs of random muon-candidates that acci- 
dentally meet the selection requirements (combinatorics). 

Outside the silicon subsystems is the COT [25|, an 
open-cell multi-wire drift chamber divided into eight con- 
centric superlayers. The superlayers themselves are di- 
vided in <p into supercells, each containing 12 sense wires. 
In addition to charged-particle trajectories, the COT also 
measures the ionization dE/dx per unit path- length for 
particle identification. For this analysis the dE/dx infor- 
mation is mainly used to help reject kaons. 

Surrounding the COT is a superconducting solenoidal 
magnet producing a 1.4 T magnetic field parallel to the 
beam axis. The px resolution of the COT, a PT /p T « 
0.15% (GeV/c) -1 [25|, is determined by comparing the 
curvature of inward- and outward-going tracks of cosmic- 
ray events. The absolute momentum scale is determined 
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FIG. 2: Muon system cj> and r\ coverage. 



using J/ip, T, and Z-boson resonances, where the reso- 
nances decay into two muons (26| . 



B. Muon system 

Outside the solenoidal magnet are electromagnetic and 
hadronic calorimeters, which in turn are surrounded by 
the muon systems consisting of multi-layer single-wire 
drift chambers and scintillators. The drift chambers are 
used to reconstruct muon-track segments (stubs) while 
the scintillators are used for timing information to match 
muon candidates to the correct pp collision crossing. The 
r\ and <p coverage of the muon subdetcctors used in this 
analysis is shown in Fig. [5] 

In the central region, the cylindrical central muon 
chambers (CMU) [27| provide coverage up to \r}\ < 0.6. 
Because material corresponding to 5.5 interaction lengths 
lies between the pp luminous region and the CMU, a 
muon must have a minimum px of 1.4 GeV/c (range-out 
threshold) to reach the CMU. The CMU is subdivided 
in 24 wedges in </>, each with four layers of drift cham- 
bers, and lies immediately outside the central hadronic 
calorimeter. 

Beyond the CMU are additional central chambers with 
nearly the same r\ coverage known as the CMP. Addi- 
tional steel absorber with a thickness of 2.3 interaction 
lengths is placed between the CMU and CMP yielding a 



range-out threshold p T of 2.2 GeV/c. The CMP forms 
a box around the cylindrical CMU and is comprised of 
four layers of drift chambers, and a layer of scintillator. 

The CMX [2(| detector extends the muon system cov- 
erage to higher pseudorapidity, 0.6 < \rj\ < 1.0. The 
CMX consists of two arches at each end of the detector, 
with additional upper and lower sets of chambers. The 
CMX consists of eight layers of drift chambers arranged 
in conic sections; the geometry is such that particles can 
traverse six of the eight layers on average. The range-out 
threshold p T for the CMX is about 2.0 GeV/c. 

In this analysis muons are required to have either a 
CMU or CMX stub and have the stub matched to an 
extrapolated track from the tracking system. Further in- 
formation, such as z position and stub angle, is used in 
a multivariate likelihood discriminant for muon identifi- 
cation. Information from the CMP detector is used, if 
available, to identify high-purity muons. The muons are 
paired into either a CMU-CMU (CC) or a CMU-CMX 
(CF) channel. The selection criteria for these two chan- 
nels are discussed in the next section. 



C. Muon trigger 

At the Tevatron pp crossings occurred every 396 ns 
(2.5 MHz) and peak instantaneous luminosities of 4 x 
10 32 cm _2 s~ 1 were achieved, creating an event rate of 
1.7 MHz. An online system of custom hardware boards 
and software algorithms (the "trigger" system) was em- 
ployed to reduce the data rate to about 100 events per 
second, which were recorded to tape for later analysis. 
The CDF trigger system was divided into three consecu- 
tive levels with increasing granularity, sophistication, and 
precision. Trigger level one (LI) was evaluated for every 
pp crossing and used coarse track, calorimeter, and muon- 
stub information to identify potentially interesting events 
that were then passed to trigger level two. Trigger level 
two (L2) used more precise calorimeter and muon-stub 
information to eliminate events poorly reconstructed in 
trigger LI. Accepted events were passed to trigger level 
three (L3), a CPU farm performing full event reconstruc- 
tion and identifying the most interesting events to record 
to tape for later analysis. The data sets used in the 
present analysis were collected with a set of LI, L2, and 
L3 triggers that required a pair of muon candidates. 

In LI, muons were identified by matching a track re- 
constructed in the COT to a muon stub reconstructed in 
one of the muon systems. The track reconstruction was 
performed by a custom-built system [28j that achieved 
a pt resolution of a VT jp\ = 1.7%/(GeV/c). Custom 
electronic boards identified muon stubs in each of the 
CMU, CMX, and CMP systems and performed a coarse 
matching to the LI tracks [2{|. Events were required 
to have two separate CMU-track matches or one CMU- 
track match and one CMX-track match. The particles 
associated to the tracks were required to have opposite 
electric charge. In L2 the track muon-stub matches were 
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confirmed using more sophisticated algorithms and im- 
proved resolutions. In order to remove through-going 
cosmic-ray muons and backgrounds from dijet events that 
generate falsely-identified muon candidates (fake muons), 
only events with a dimuon opening-angle less than 120 
degrees in the plane transverse to the beam line were 
passed to L3. The full event reconstruction employed in 
L3 performed a full track fit and required CMU muon 
candidates to have pt > 1.5 GeV/c, CMX muon can- 
didates to have pr > 2.0 GeV/c, the scalar sum px for 
the two muon candidates to exceed 5 GeV/c, the dimuon 
mass to be less than 6 GeV/c 2 , and the difference in the 
z coordinates of the muon tracks at the point of closest 
approach to the beam line to satisfy |Azo| < 5 cm. At 
the highest instantaneous luminosities, the accept rate of 
this dimuon trigger path was too high and events were 
randomly discarded with a frequency that depended on 
the instantaneous luminosity. This reduction in rate dis- 
carded approximately 10% of the total dimuon candidate 
events. In the full Run II data set collected by CDF, 
822 740 (498 443) events satisfied the CMU-CMU (CMU- 
CMX) trigger path with M^- > 4.669 GeV/c 2 and 
formed the initial data sample for the CC (CF) channel. 



III. MONTE CARLO SIMULATION 

We employ Monte Carlo (MC) simulations of B® — > 
H + p~ and B° — > decays together with a full 

CDF detector simulation to estimate signal efficiencies 
not measurable with data control samples. In addition 
to the B®(B°) —> p + p~ MC sample we also produce a 
sample of B + — >• J/tp K + simulated events for modeling 
cross-checks. A MC sample of B2 — » is generated 

using Pythia [30| and EvtGen [3l| with the underlying 
event modeling tuned to reproduce minimum-bias events 
in pp collisions at y/~s = 1.96 TeV [32] ]. We generate 
simulations of 66 pair production and their subsequent 
hadronization. One of the resulting B hadrons is required 
to be a B® or B° meson that decays to two muons. There 
are no requirements on the second B hadron, which is 
allowed to decay inclusively The MC events are also 
run through a detailed CDF II detector simulation [33[ 
that accounts for resolution and occupancy effects in all 
the subdetector systems. The MC events are required to 
meet all the baseline requirements discussed in Sec. IIV Al 
with the exception of the dE/dx and muon likelihood 
requirements, which are omitted because data-driven es- 
timates of their efficiency are used instead. We weight 
the £?°-meson pt and isolation (cf. Sec. IIV A[) distri- 
butions to match the measured spectra obtained from 
B + — > J/tp K + data and B® — > J/tp </> data, respec- 
tively. The B + pt spectrum is expected to be similar 
to that of the B® for pt > 4.0 GeV/c and provides a 
significantly larger sample size. The isolation spectra, 
however, may differ between B + and B® mesons due to 
the participation of u and s quarks in the hadronization 
processes, respectively. For the B° —> search the 



B°-meson pt and isolation distributions are weighted us- 
ing B+ J/tp K+ data. 

We use simulated B° s — > /J + /i~ decays to estimate 
a mass resolution of 24 MeV/c 2 for events passing the 
baseline and vertex requirements described in Sec. IIV Al 
Comparisons between data and MC using J/tJj — > /x + 
and B + J/tp (— >• /i + /i - ) K + samples reveal a 10% 
discrepancy in mass resolution, which is propagated as a 
systematic uncertainty and negligibly affects the efficien- 
cies discussed in Sec. [V] 



IV. EVENT SELECTION 

This analysis searches for B® — > p, + n~ and B° —> 
decays using the full 10 fb _1 CDF II data set. 
The same analysis methods are used for both decays. The 
signal-search dimuon-mass range is adapted to the differ- 
ent B° and B® pole masses and corresponds to ±2.5 times 
the two-track mass resolution. The branching fractions 
are measured relative to a B + — > J/tp (— > /i + /i~) K + 
normalization mode. This mode, together with directly 
produced J/tp —> /i + p~ decays, are used to estimate sig- 
nal efficiencies and perform consistency checks. Initially 
we apply baseline requirements (described in Sec. IIV Ap 
on all data and MC samples. An artificial neural network 
(NN) classifier is then applied to enhance the expected 
signal-to-background-ratio. In order to avoid inadver- 
tent biases, the data in an extended mass-signal region, 
5.169 < M^ +Jli - < 5.469 GeV/c 2 , are kept hidden until 
all selection criteria are finalized. An unbiased optimiza- 
tion of the analysis is performed using mass-sideband 
data model of the combinatorial background and 
MC events as a model of the peaking backgrounds and 
signal. 

Major background processes include Drell-Yan dimuon 
production (qq — > fi + p~) processes through virtual 7 
and Z-boson states, double scmilcptonic decay (66, cc — > 
/j + /i~A), and sequential scmilcptonic decay (6 — > 
Cfi~X — > sp~ p + X') of 6 and c quarks. A combina- 
tion of a scmilcptonic decay and a fake muon or two 
fake muons from two-body hadronic decays of B hadrons 
(B — > h + h'~ where h and h! are charged pions and 
kaons), can also be a source of background. Fake muons 
are tracks from pions and kaons that have a matching 
muon stub and are falsely identified as muons. The 
backgrounds can be divided into a combinatorial dimuon 
background and a peaking B — > h + h'~ background, 
which arc estimated separately. Backgrounds are stud- 
ied in detail in statistically-independent control samples 
with various baseline requirements inverted or relaxed to 
enhance the background contribution. 

The search for a B a s (B a ) — > n + p~ signal is performed 
in bins of NN output. The NN binning is determined by 
an a priori optimization, discussed in detail in Sec. IVIIII 
that uses the expected B(B a s — > /i + /i~) limit in absence 
of signal as a figure of merit, resulting in eight NN bins. 
Additionally, the signal region is divided into five mass 
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bins centered on the world average B® and B° masses. 
This yields a total of 80 single-bin counting experiments 
corresponding to the CC and CF topologies, each with 
eight NN bins, and five mass bins. 

Once the signal efficiencies and background estimates 
are well understood, a thorough statistical analysis of 
the result is performed. The sections below discuss the 
analysis methodology and results in more detail. 



structed -B-meson candidates must also have \rj\ < 1 and 
Pt > 4.0 GeV/c. We reconstruct the pp interaction point 
for each event by refitting track-helix parameters of all 
charged particles, after excluding the muon candidates, 
that have Zq within 1 cm of the dimuon average zq and 
have Pt > 0.5 GeV/c to a common space point (vertex). 
If the fit fails, the primary vertex is determined by the 
beam line position, estimated using COT+SVXII infor- 
mation, at the average zq of the dimuon pair. 



A. Baseline event selection 

Except where specifically discussed, all samples used in 
this analysis are required to pass a set of baseline require- 
ments that consist of kinematic-, particle-identification-, 
and vertex-related requirements discussed in this section. 

Muon-candidatc tracks are required to be matched 
with a muon identified by the trigger and have pr > 2.0 
GeV/c and p T > 2.2 GeV/c for CMU and CMX muons, 
respectively. Tracks are required to be fiducial to the 
COT, by demanding that the absolute value of the z co- 
ordinate be less than 155 cm at the COT exit radius 
(r = 136 cm). This ensures that tracks traverse the full 
radial extent of the COT. Tracks are also required to have 
r<fi hits in at least three L00+SVXII layers. 

A likelihood method [H| together with a dE/dx based 
selection [35j is used to further suppress contributions 
from hadrons misidentified as muons. The muon likeli- 
hood is based on matching muon stubs to COT tracks 
in the (j) an d z coordinates, energy deposits in the elec- 
tromagnetic and hadronic calorimeters, information con- 
cerning the subsystems in which the muon is identified, 
and kinematic information. We require a muon likeli- 
hood with a value greater than 0.1, which is approxi- 
mately 99% efficient for signal, while rejecting 50% of 
the combinatorial background, which contains a signif- 
icant fraction of hadrons misidentified as muons. The 
B — > h + h'~ decays are efficiently rejected as discussed in 
detail in Sec. IVTHl 

A calibration for the dE/dx measurement is applied to 
ensure stability over the tracker volume and operational 
conditions. This calibration corrects for effects depen- 
dent on instantaneous luminosity, local density of tracks, 
and kinematic information. We make the requirement 

In ^ d^/dx°. ) ^ — 0-83, where In is the natural logarithm, 
dE/dx is the observed dE/dx after the calibration has 
been applied, and dE/dx e is the expected dE/dx esti- 
mated using the observed particle's momentum and the 
muon mass hypothesis. This requirement is nearly 100% 
efficient for muons, while rejecting about 50% of kaons. 
Estimations of the signal efficiency and background re- 
jection for the choice of likelihood and dE/dx selection 
criteria are made using J/ip — > fi + p~ events compared 
to combinatorial background from the dimuon-mass side- 
bands and kaons from B + —> J/ip K + decays. 

Muon pairs are required to have an invariant mass 
in the range 4.669 < M^+p- < 5.969 GeV/c 2 . Rccon- 



In addition to the above requirements, we fit the 
two muon tracks to a common secondary vertex. Sev- 
eral demands on secondary-vertex-related variables are 
made. We define a three-dimensional displacement 
length L 3 n = p(B) ■ Xb/\P(B)\, where p(B) is the B- 
candidate-momentum vector estimated as the vector sum 
of the muon momenta and xb is the secondary- vertex 
position vector determined relative to the primary- 
vertex position. We estimate a proper decay time t = 
LsjjMn+u,- /\P\B)\, where M^+ M - is the dimuon invari- 
ant mass, which in turn is used to define the proper 
decay-length A = ct. The baseline requirements demand 
that the measured proper decay-length of the B candi- 
date, with its uncertainty, <j\, satisfy < A < 0.3 cm and 
\/<j\ > 2; the secondary vertex fit xV-^dof, where iVdof 
is the number of degrees of freedom, must be less than 
15; the three-dimensional displacement length and its un- 
certainty <jl 3D satisfy L^u < 1-0 cm and <jl 3D < 0.015 
cm; the three-dimensional opening angle between xb and 
p(B), Aft, satisfy Af2 < 0.7 rad; and the 5-candidatc 
track-isolation, I, satisfy I > 0.50. The isolation is de- 
fined as / = pr(B) /(pt{B) + ^^pr(i)), where the sum 

i 

goes over all charged particles with px > 500 MeV/c and 
within an rj(j) cone centered around the £?-mcson momen- 
tum with radius R = \J{Aij) 2 + (A<p) 2 < 1.0. The trig- 
ger and baseline requirements result in a total of 60 842 
CC and 64495 CF muon pairs, shown in Fig. [3J 



For the final selection, we define search regions around 
the known B® and B° masses (3(|. These regions corre- 
spond to approximately ±2.5<r m , where a m sa 24 MeV/c 2 
is the two-track mass resolution estimated from B® — > 
MC events satisfying the trigger and baseline re- 
quirements. The sideband regions 5.009 < M M + M - < 
5.169 GeV/c 2 and 5.469 < M M+jU - < 5.969 GeV/c 2 are 
used to estimate combinatorial backgrounds. Dimuon 
candidates with mass smaller than M„+„- = 5.009 
GeV/c 2 are not used for background estimations due to 
b — > /j, + p~X [37} contributions but are used for the NN 
training. It was verified that inclusion of these candidates 
does not significantly affect the discriminating capabili- 
ties of the NN. Backgrounds from B —> h + h'~ decays, 
which peak in the signal mass region, are estimated sep- 
arately. 



8 



™ o 7000 




FIG. 3: Dimuon invariant mass distribution for events sat- 
isfying the baseline requirements for the B® — > fi + /i~ and 
B — > search sample with linear fit overlaid for the CC 

(top) and CF (bottom) channels. 



B. Normalization decay mode 

A sample of B + — > J/ip K + events serves as a nor- 
malization decay mode. The B + — > J/ip K + sample is 
collected using the same dimuon triggers and selection 
requirements as used for the signal sample so that com- 
mon systematic uncertainties are suppressed. The kaon 
candidate must satisfy the same COT and L00+SVXII 
requirements as the muon candidates and must have 
Pt > 1 GeV/c, a regime for which the COT track- 
ing efficiency is well understood. For each kaon can- 
didate the pT , and K + tracks are constrained to 
originate from a common vertex in three dimensions. 
The \ 2 probability of the vertex fit is required to be 
greater than 10~ 5 . Additionally, the dimuon invariant 
mass is required to be consistent with the world av- 
erage J/ip mass (3.017 < M^- < 3.177 GcV/c 2 ). 
The [i + fi~K + mass distribution of candidates satisfying 
these criteria is shown in Fig. [4] A signal mass region, 
\M ft +^- K + - M B+ \ < 35 MeV/c 2 , is used together with 
mass sidebands to estimate a B + — > J/ip K + yield of 
28 081±219 and 12 144±153 in the CC and CF channels, 
respectively, using simple sideband subtraction. These 
yields include a 0.14% correction for B + — > J/ipn + con- 
tributions. The correction factor is determined by com- 
paring the relative geometric acceptance, reconstruction 
efficiency, mass window efficiency, and branching frac- 




FIG. 4: The fi fi~ K invariant mass distribution for events 
satisfying the requirements for the B + — > J/ip K + sample 
with fit overlaid for the CC (top) and CF (bottom) channels. 



tion of B+ -> J/ip K+ and B+ -> J/ipn+ decays. The 
uncertainties on the B + — s- J/ip K + yields are due to 
the limited size of the fi + K + sample. The shape of 
the n + fi~K + mass distribution is parametrized using a 
Gaussian summed with a first-order polynomial and a fit 
to the data yields a Gaussian mean of 5280 ± 11 MeV/c 2 
and 5274 ± 11 MeV/c 2 for CC and CF, respectively, con- 
sistent with the world average B° mass. 



C. The neural network discriminator 

We search for B® — > (B° — > decays 

in a narrow 120 MeV/c 2 mass window centered around 
the world average B° s (B°) mass. After application of 
the baseline selection criteria the data sample is dom- 
inated by combinatorial background and has a signal- 
to-background ratio of approximately 10~ 4 . We signifi- 
cantly improve the discrimination power between signal 
and combinatoric background by combining kinematic, 
isolation-related, and lifetime-related variables using a 
neural- network classifier. A key feature of the NN is that 
it is designed to be independent of the dimuon mass. This 
allows an estimation of the combinatorial background in 
the signal mass region by interpolation from mass side- 
bands after the NN has been applied to the data. 
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Fourteen variables describing the measured 
kinematics-, isolation-, and lifetime- related proper- 
ties of the signal are selected to construct a NN 
discriminant vm- These variables are selected based 
on a study of the physics characteristics of the signal 
and background using training samples described be- 
low. Variables that are poorly modeled by the MC or 
correlated to dimuon mass are excluded from the NN. 
The signal and background samples are separated into 
CC and CF data sets that are exclusively independent. 
Trainings arc performed separately using the same input 
variables for the CC and CF data sets. 

We chose the neurobayes NN package [H, |39[ for the 
construction of the NN. Using this multivariate analysis 
technique, we achieve a background rejection of 99.9%, 
while maintaining a signal efficiency of 40%. 



1. Background training sample 

We define two dimuon mass sidebands, 4.669 < 
M p+/1 - < 5.169 GeV/c 2 (lower SB) and 5.469 < 
Mfi+fi- < 5.969 GcV/c 2 (upper SB), which arc used 
to construct a combinatorial background sample used as 
one of the NN training samples. Although this analy- 
sis is based on 10 fb - , the NN optimization was done, 
a priori, during the previously published analysis based 
on 7 fb" 1 |3. A total of 36 329 CC and 39657 CF 
mass-sideband events survive the baseline requirements 
in 7 fb -1 of integrated luminosity and are used as the 
background sample for the NN training and testing. 



2. Signal training sample 

The simulated B° s — > /-t + /i~ signal sample used for 
the NN training is described in Sect. IIIII The dimuon 
mass range used for this signal sample corresponds to 
the search region defined in Sec. IIV Al A sub-sample of 
MC signal events equal in size to the sum of the back- 
ground samples is randomly chosen for NN training while 
the full MC sample is used to estimate the efficiency of 
the neural network. 



3. Input parameters 

Candidates from combinatorial backgrounds, when 
compared to signal events, tend to have smaller dimuon 
mass, shorter proper-decay-lengths, a softer pt spectrum, 
and a higher density of tracks near the B® candidate. We 
investigated a variety of kinematic, isolation, and life- 
time variables as inputs to the NN. After removing vari- 
ables that were poorly modeled or were found to cause 
a correlation between vn and the dimuon mass, an ini- 
tial set of 20 discriminating variables remained. Multiple 
neural networks were trained varying the number and 



combination of input parameters employed. The per- 
formances of the trained networks were compared in the 
plane of combinatorial-background acceptance versus sig- 
nal acceptance. A NN using 14 input variables was found 
to offer excellent background discrimination and is em- 
ployed as the final discriminant in this analysis. The 14 
variables employed are listed here in order of descending 
discrimination power between signal and background: 

Af2: Three-dimensional angle between the B® momen- 
tum and the vector pointing from the primary to 
secondary vertex. 

I: Isolation of the B® candidate as defined in Sec. IIV Al 

Larger do(fi): For the muon pair, the impact parameter 
of the muon with the larger value. 

do(B°): Impact parameter of B® candidate with respect 
to the primary event vertex. 

Lt/clt- Significance of the transverse decay length 
Lt, where Lt = Pr{B) ■ x*t/\pt(B)\ and xt is 
the secondary- vertex-position vector relative to the 
primary-vertex position in the plane transverse to 
the beam line. 

X 2 '- X 2 P er degree of freedom of the secondary- vertex fit. 

L^£>: Three-dimensional vertex displacement as defined 
in Sec. lIVAl 

Lower px For the muon pair, the transverse mo- 
mentum of the muon with the lower value. 

Significance of smaller do(fi): do(/j,) / cr^^ of the 
muon with smaller impact parameter, where o~d {^) 
is the estimated uncertainty of da(n). 

X/cry. Significance of A. 

Smaller |do(/i)|: For the muon pair, the impact param- 
eter of the muon with the smaller value. 

A: Three-dimensional proper decay length defined in 
Scc. HVAl 

AJ1t= Angle between the B® momentum and the vector 
pointing from the primary to secondary vertex in 
the plane transverse to the beam line. 

Significance of larger do(fi): do(f^) / o~d (fi) of the 
muon with larger impact parameter. 

When available, the silicon rz tracking information 
strongly discriminates against combinatorial and par- 
tially reconstructed backgrounds. Tracks from combi- 
natorial background are less likely to originate from a 
common vertex in three dimensions and arc suppressed 
by the vertex-fit quality information used in the multi- 
variate discriminate. Similarly, precise three-dimensional 
reconstruction of the primary and secondary vertices al- 
lows the comparison of the -B-candidate flight direction 
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FIG. 5: Comparison of the combinatorial background distri- 
bution (solid), taken from dimuon mass sideband events in 
the first 7 fb -1 of data, to the signal distribution (dashed), 
taken from MC events, for the six most discriminating of the 
NN input variables. 



and the secondary- vertex vector, which rejects both com- 
binatorial and partially-reconstructed background. Com- 
parisons of the distributions of background and signal 
samples for the input variables with the greatest discrim- 
inating power are shown in Fig. [5J 



4- Discriminant output 

When training the NN, 80% of each training sample is 
used for the actual training, while the remaining 20% is 
used for validation and over-training tests. The trained 
NN takes the input parameters for every event and re- 
turns an output value vm m the range [0 (background- 
like), 1 (signal-like)]. The combined CC and CF vm dis- 
tribution is shown in Fig. [6] for background and signal 
separately. For various vn requirements, the resulting 
signal and background efficiencies are given in Table HI 



D. Neural network consistency checks 

We perform several consistency checks of the NN, in- 
cluding tests for M^+^--vn correlations, tests of over- 



CC+CF sideband sample 
CC+CF signal MC sample 




0.2 0.4 0.6 0.8 1 
NN output 



FIG. 6: Distributions of un for signal and background sam- 
ples. The background sample consists of dimuon mass side- 
band events from the first 7 fb" 1 of data. 



training, and studies of MC mis-modeling of the sig- 
nal distribution. 



1. Check for correlations between vn and dimuon mass 

We estimate the dominant combinatorial background 
in the dimuon mass signal region by linear interpolation 
from the dimuon mass sideband region. An unbiased esti- 
mate of the resulting combinatorial background requires 
that vn is independent of the dimuon mass. We per- 
form several studies of the M^+^-vjy dependence. For 
example, we divide the sideband sample into an "inner" 
region and an "outer" region according to dimuon mass. 



The inner region is defined as 5.002 < M„+„- < 5.169 
GeV/c 2 and 5.496 < M M+M - < 5.636 GeV/c 2 , and is 
used as a "signal" sample. The outer sample is formed 
by the remaining events in the sideband, and is treated 
as a "background" sample. Using these "inner signal" 
and "outer background" samples, the NN training is re- 
peated with the same set of 14 input variables listed in 
Sec. IIV C 31 and the resulting z/jy distributions are com- 
pared. This check is based on the observation that event 
properties are nearly identical for events in the inner and 
outer regions and thus only differ by dimuon mass. The 
resulting distributions for the inner and outer samples 
are compared in Fig. [7] for the CC and CF channels sepa- 
rately. No significant difference in the vjq distribution is 
found, indicating that vjq is independent of dimuon mass. 
This is strong evidence that the NN cannot use the 14 
input variables to infer anything about the dimuon mass 
and that M„+„- and vjq are uncorrelated. 

As a further investigation of Af„+„--i/jv dependen- 
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TABLE I: Efficiency for various vn requirements, prior to MC weighting, signal (Sig. 
CC and CF trainings separately. Uncertainties include only the statistical component. 
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FIG. 7: Comparison of the vn distribution of inner sideband events to outer sideband events using the full 10 fb" 1 of data 
and a custom-trained NN as described in Sec. IIV D II The CC and CF channels are shown separately. 



cies, we study the average value of vjy as a function 
of the dimuon mass. We use both the signal-sideband 
data, with two oppositely-charged muons and A > 
(OS+), and a control sample consisting of events with 
two oppositely-charged muons but with proper-decay 
length A < (OS—). The OS— control sample is dis- 
cussed further in Sec. IVICl and is dominated by prompt 
combinatorial background. The resulting distributions, 
shown in Fig. [8j are consistent with a flat line, again 
indicating that M^+p- and z/jv arc independent. 

To check for correlations of vn with small variations 
of mass - of the order of the mass difference between B° 
and B°- a separate B° MC sample is produced. The re- 
sulting vn distribution is compared with that obtained 
from the B® MC sample in Fig. [9] No significant differ- 
ence between the B° and B® distributions is found, 



again indicating the M^^- and i/jv are independent. 

We conclude that the choice of discriminating variables 
yields a NN classifier with excellent signal-to-background 
discrimination, while remaining independent of M /J + /i - 
and leaving the shape of the dimuon mass distribution 
unchanged. 



2. Check for NN over-training 

A portion of the background and signal training sam- 
ples is set aside for internal validation tests. These tests 
include metrics provided by the neurobayes package 
that are sensitive to over-training. All the metrics show 
that no over-training occurred for any of the neural nets 



12 




5.2 5.4 5.6 5.8 



M^-[GeV/c1 



S 1 

Q. 
-i— ' 

Z3 

o 

^ 0.9 



0.8 



0.7 



os- 

V NN> - 7 



5.2 5.4 



M 



5.6 5.8 
[GeV/c 2 ] 




0.9 1 
NN Output 



FIG. 9: Comparison of the un distribution for simulated sam- 
ples of B® and B° signal events. The distributions have been 
normalized to the same area over the entire NN output range, 

< U N < 1. 
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FIG. 8: Result of the ^--vn correlation check using 
the first 7 fb _1 of data. Top: correlation between average 
vn and dimuon mass in the OS+ sample for which we keep 
the — > fi + fj,~ + B° — > fi + fi~ signal region blinded. Bot- 
tom: correlation between average and dimuon mass in 
background-dominated OS— sample. 
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FIG. 10: The vn distributions for networkks trained with 
33%, 50%, and 80% of the input background sample taken 
from the dimuon mass sideband events reconstructed in 7 fb~ 
of data. 



used in this analysis. As a further test of possible over- 
training, we repeat the NN optimization using 33% and 
50% of the input background sample in the training. The 
resulting vn distribution for sideband events is compared 
among these two trainings and the default training in 
Fig. [10] and no significant differences are observed. Wc 
conclude there is no evidence of NN over-training. 



3. Validation of the discriminant distributions 

We check the MC modeling of the 14 input variables 
and additional kinematic and lifetime variables using 
the sample of B + — > J/ipK + events. Distributions 
from B + — > J/ip K + data and MC are compared in 
Figs. [TTJ and Q2J in order to better mimic the resolutions 
relevant for B° — > decays, the vertex variables use 

only the two muons from the J/ tp of the B + J/ip K + 
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decay while the B-hadron momentum variables, p{B) 
and pt(B), and isolation variables use the three-track 
information. The i>n distribution obtained from the 
B + —> J/ip K + MC is compared to that obtained us- 
ing the sideband-subtracted B + — >• J / ip K + data in Fig- 
ures [13] and [14] for the CC and CF channels, respectively. 
The small discrepancies observed are used to assign sys- 
tematic uncertainties as discussed in Sec. IV D[ 



V. DETERMINING THE B° -> n+fi~ 
BRANCHING FRACTION 

The branching fraction B(B° S (B°) — > is deter- 

mined using Eq. ([1]), where a,B°m°) is the geometrical 
and kinematic acceptance of the triggers employed to 
collect the dimuon data set; e^o^o) is the trigger effi- 
ciency for Bg(B°) hadrons decaying to within the 
acceptance; e^ B0 ) is the efficiency of the reconstruc- 
tion, baseline, and mass requirements for pairs 
satisfying the trigger requirements; and e^fs ) ^ s the 



efficiency of the NN selection for events satisfying the 
trigger and baseline requirements and a given set of 
requirements. The equivalent efficiencies and acceptance 
for the normalization mode arc indicated with the B + 
subscript. No term appears since the NN is not ap- 
plied to the normalization mode. The B^(B°) 
and B + — > J/ip K + acceptance and efficiencies are es- 
timated separately to account for kinematic differences 
arising from the differing B-hadron decays. The 6-quark 
fragmentation-fraction-ratio / M // s , the relevant product 
branching fractions for the normalization mode B(B + — > 
J/i[> K + —» /i + \x~ K + ) , and their uncertainties are taken 
from Ref. [36| • For the B° search, f s is replaced by fd and 
the fragmentation ratio is set to unity. Due to their dif- 
fering sensitivities, the CC and CF channels are treated 
separately and then combined to yield the final result. 
Normalizing the observed signal rate to the rate of an 
abundant, well-known, and kincmatically-similar decay 
results in a significant reduction to the total uncertainty 
since systematic effects largely cancel in the acceptance 
and efficiency ratios of Eq. ([1} . 
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We define the single-event-sensitivity (SES) as the 
branching fraction determined from Eq. ([T]) when setting 
Nb°(b°) = !■ The SES approximates the smallest sig- 
nal branching-fraction to which the analysis is sensitive. 
The methods used to estimate the inputs to Eq. ([1]) are 
described below and the results are summarized in Ta- 
ble [II] Combining the CC and CF channels yields a SES 
for the B° -> fi + pT search of 1.4 x 10~ 9 with an 18% to- 
tal uncertainty. The combined SES for the B° —> /i + pi~ 
search is a factor of 3.5 smaller with a reduced uncer- 
tainty of about 12% since the fragmentation ratio fd/ fu 
is not relevant. 



A. Acceptance 

The acceptances are determined using B° s — > fJ, + [i~ 
and B+ -> J/ip K+ MC simulations for B° s and B+ 
mesons that satisfy \y\ < 1.0 and pr(B^) > 4 GeV/c, 

where y = g In ^ e-^' J i s the rapidity. Both muons must 
satisfy the fiducial and kinematic requirements of the 
trigger discussed in Sec. Ill Cl Muons are required to have 
p T > 2.0 GeV/c if detected in the CMU, and p T > 2.2 
GeV/c if detected in the CMX and their trajectories must 
extrapolate to the active fiducial- volumes of the muon, 
COT, and L00+SVXII systems. The kaon in the normal- 
ization channel must have px > 1-0 GeV/c and its tra- 



jectory must extrapolate to the active fiducial-volumes of 
the COT and L00+SVXII systems. Effects from COT- 
track reconstruction, multiple scattering, and stub-track 
matching are included in the reconstruction efficiencies 
discussed below. Systematic uncertainties on the accep- 
tances are assessed by varying the 6-quark mass, fragmen- 
tation modeling, and renormalization and factorization 
scales in the MC generation by one standard-deviation- 
uncertainty and quantifying the resulting change in the 
acceptance ratio. Additionally, we assign variations of 
the acceptance due to changes in the size of the pp lu- 
minous region as a systematic uncertainty. The observed 
differences are added in quadrature. The final acceptance 
ratio including statistical and systematic uncertainties is 
given in Table im for the CC and CF channels separately. 



B. Trigger efficiencies 

The trigger efficiency is measured separately for the 
LI, L2, and L3 dimuon triggers using data control sam- 
ples collected using unbiased trigger selections identify- 
ing a high-purity sample of dimuon events that satisfy 
the criteria of the trigger under study. The efficiency is 
determined from the fraction of events in this sample for 
which also the trigger under study fires. Dimuon data 
collected requiring only trigger LI are used to measure 
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FIG. 11: A comparison of B + — > J/ip K + sideband-subtracted data to simulated B + — > J/ip K + events for a variety of 
kinematic and lifetime-related distributions, including some used as input to the neural net. 



the L2 and L3 efficiencies, which exceed 99%. The LI 
dimuon trigger efficiency is determined as the product of 
the LI efficiency for each muon separately. The single- 
muon LI efficiencies are measured using a tag-and-probe 
method on samples of high-purity J/ip — > events 
collected with a trigger LI that requires only one muon. 
The muon firing the LI trigger used to collect the J/ip 
control sample is identified as the "tag" , while the second 
muon from the J/ip decay is unbiased with respect to the 
trigger whose efficiency we are studying and can be used 
a "probe". Dimuon mass-sideband subtraction is used 
to remove effects of the small background present in the 
sample. The single- muon LI efficiencies are parametrized 
as a function of the date the data were recorded and the 
track pt, \rf\, and <fi for CMU and CMX muons sepa- 
rately. The paramctrization by date accounts for sig- 
nificant changes in detector operating conditions aris- 
ing from variations in trigger configuration, COT perfor- 
mance, or Tevatron beam parameters. The parametriza- 
tion in px describes the rapidly changing LI efficiency 
near the trigger px threshold. The parametrization in 
1 77 1 describes changes in the LI efficiency due to the in- 



creased ionization path-lengths of tracks traversing the 
COT at large \rj\, which may increase the probability for 
the corresponding hits to exceed the noise threshold and 
fire the trigger. The parametrization in <p is primarily 
important for a small amount (0.2 fb _1 ) of early data 
for which the COT gain was temporarily degraded in 
the bottom portion of the chamber [4ol |. The resulting 
singlc-muon LI efficiency is about 96% for the muons 
relevant for this analysis and plateaus at about 99% for 
muons with px > 5 GcV/c. The dimuon LI efficiency is 
estimated by convolution of the single-muon efficiencies 
with the (p T , 1 77^ |, (f>^ , p T , I, 4^ ) distribution 
obtained from B° and B+ ~* J/4 K+ M C for 

events in the geometrical and kinematic acceptance of the 
trigger. This convolution yields a dimuon LI efficiency of 
about 93% for both the B° s and B+ J/ip K + 

samples in each of the CC and CF channels. The total 
systematic uncertainty for the trigger LI efficiency is in 
the 1-2% range and is dominated by variations of the 
singlc-muon efficiency as a function of the isolation of 
the muon and by differences between muons detected in 
the 77 > and 77 < volumes of the detector. Double- 
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FIG. 12: A comparison of B + — > J/tp K + sideband-subtracted data to simulated B + — > J/ip K + event for several variables 
used as input to the neural net. 



muon correlations are studied and found to be negligible. 
When propagating the single-muon uncertainties to the 
dimuon efficiency, the uncertainties are taken to be 100% 
correlated between the two muons. 

The total trigger efficiency is determined as the prod- 
uct of the LI, L2, and L3 trigger efficiencies for the 
— > and B + — > J/ip K + modes separately. 

The total uncertainty of the efficiency ratio is much less 
than 0.01 and is estimated by treating the B® — > [i + [i~ 
and B + — > J/tjj K + total uncertainties as 100% corre- 
lated. The final trigger efficiency ratios for the CC and 
CF channels and their associated uncertainties arc shown 
in Table HU 



C. Reconstruction efficiencies 

The total reconstruction efficiency is factorized into 
the efficiency for COT-track reconstruction, muon-stub 
reconstruction, association of L00+SVXII hits to the 
COT track, the dimuon-vertex reconstruction, and the 
dimuon-mass reconstruction efficiencies. For the normal- 



ization mode, the kaon COT, the kaon silicon, and the 
B + vertex reconstruction efficiencies arc also evaluated. 



1. COT track reconstruction efficiency 

The probability for identifying a single track in the 
COT is evaluated by embedding MC tracks into real data 
events and measuring the fraction of those tracks that are 
successfully reconstructed and satisfy the CDF standard- 
track-quality requirements. Hits from MC charged- 
particle tracks are inserted into events from beam data. 
In readout channels where MC hits overlap with hits in 
the event record, the hits are merged. The MC is tuned 
so that quantities such as deposited charge per hit and 
single- hit position resolution match those observed in the 
data. The dependencies of these quantities on various pa- 
rameters, such as the track ij, the electric charge of the 
track, and the track isolation, are also accurately repro- 
duced in the MC. For charged particles fully fiducial to 
the COT and with p T > 1.5 GcV/c, the COT recon- 
struction is consistent with being fully efficient. System- 
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TABLE II: A summary of the inputs used in Eq. ([T]) to determine B(B® — > fi + fj,~) in the CC and CF channels separately. 
The relative uncertainties are given parenthetically. The uncertainties for the trigger efficiency ratios are significantly smaller 
than 1% and are denoted by 0.00. The single-event-sensitivities (SES) for vn > 0.70 are given in the last row. Combining 
CC and CF gives a SES for the B° -> /i + fj,~ search of 1.4 x 10~ 9 (±18%). The combined SES for the B° -> /i + pT search is 
3.9 x 10" 10 (±12%). 



CC CF 

a B + /ago 0.31 ±0.02 (±6%) 0.20 ±0.01 (±7%) 

e^/e%f 1.00 ±0.00 (-) 0.98 ±0.00 (-) 

e^ /egg° 0.85 ±0.06 (±8%) 0.84 ±0.06 (±9%) 

egoi^N > 6.70) 0.92 ±0.04 (±4%) 0.86 ±0.04 (±4%) 

e^^jv > 0.995) 0.46 ±0.02 (±5%) 0.47 ±0.02 (±8%) 

N B+ 28081 ±219 (±1%) 12144 ±153 (±1%) 

fu/fs 3.55 ±0.47 (±13%) 3.55 ±0.47 (±13%) 

B{B+ -> J /if; (-> K+) (6.01 ± 0.21) x 10" 5 (±4%) 

SES {v N > 0.70) 2.2 x 10~ 9 (±18%) 3.3 x 10~ 9 (±18%) 
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FIG. 13: Comparison of the vn distribution for B + — > 
J I tp K + sideband-subtracted data to that obtained using sim- 
ulated B + — > J/ip K + events for the CC and CF channels 
separately. The MC is normalized to the observed number 
data events. 



FIG. 14: Comparison of the vn > 0.95 distribution for 
B + — > J/tp K + sideband-subtracted data to that obtained 
using simulated B + — > J/ip K + events for the CC and CF 
channels separately. The MC is normalized to the observed 
number of data events over the range < vn < 1. 



atic uncertainties include variations of the MC parame- 
ters affecting the COT hit distributions and account for 
small (< 1%) observed variations as a function of track 
isolation, pr, and rj. The dimuon efficiency is taken to be 
the square of the single-track efficiency and the resulting 
dimuon efficiency ratio is 1.00±0.01 for both the CC and 
CF channels. 



2. Muon stub reconstruction efficiency 

The muon-stub reconstruction efficiencies arc deter- 
mined from J /ip — > events using a tag-and-probe 
method similar to that described in Sec. IV Bl The "tag" 
muon candidate is required to have a COT track satisfy- 
ing the relevant trigger and COT baseline requirements of 
Sec. IIV A1 matched to a muon stub, while the other muon 
is only required to pass the COT baseline requirements. 
The muon-stub reconstruction efficiency is determined 
using the fraction of events for which the second muon 
is also matched to a muon stub. The efficiency is esti- 
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mated both with data and MC samples and the efficiency 
measured with data is divided by the efficiency measured 
with MC to account for geometric losses already included 
in the acceptance estimate. As a consistency check of 
the efficiency estimates, the same procedure is used to 
determine the efficiency of high-p^ muons using a sam- 
ple of high-purity Z° — > fi + fi~ decays. The difference in 
efficiencies is taken as the systematic uncertainty. The 
ratio of dimuon efficiencies for both the CC and CF is 
1.00 ±0.03. 



3. Muon ID efficiency 

The combined efficiency of the dE/dx and muon- 
likelihood requirements is determined using a J/ip — > 
sample by comparing the signal yield with and 
without the application of these muon identification cri- 
teria. The single-muon efficiency is determined as a 
function of px- The total dimuon efficiency is eval- 
uated by convolution of the (pt^ , p 1 ^ ) distribution 
from B a s and B + J/ip K + MC for events 

within the acceptance and satisfying the trigger, COT, 
and muon-stub reconstruction requirements. This effi- 
ciency is cross-checked in situ using B + — > J/ip K + 
data events. The difference between the efficiency in the 
B+ J/ip K + MC events and in the B+ J/ip K + 
sideband-subtracted data is assigned as a systematic un- 
certainty. The final dimuon efficiency ratio for the muon 
identification requirements is 1.01 ±0.03 for both the CC 
channel and CF channels. 



4- L00+SVXII association efficiency 

The efficiency of associating LOO and SVXII hits to 
muon tracks reconstructed in the COT is estimated us- 
ing J/ip — > events in a manner similar to that 
described in Sec. IV C 21 The J/ip sample includes only 
muons reconstructed as a track in the COT matched to a 
muon stub in the CMU or CMX and surviving the muon 
identification requirements. The efficiency for associating 
at least three rep hits from the L00+SVXII silicon layers 
is evaluated as the fraction of this sample for which the 
muon tracks satisfy that criteria. The variation of the 
efficiency on track px , isolation, and azimuthal dimuon 
opening angle is used to assign systematic uncertainties. 
The resulting dimuon efficiency ratio is 1.00 ± 0.03 for 
both the CC and CF channels. 



5. Dimuon vertex efficiency 

The efficiency of the dimuon- vertex requirements spec- 
ified in Sec. IIV Al is estimated using simulated B + — > 
J/ip K + and B® — »• samples. The resulting effi- 

ciency is found to be consistent with the efficiency deter- 
mined using sideband-subtracted B + — >• J/ip K + data 



events, thus verifying the accuracy of the MC modeling. 
The resulting ratio of efficiencies for the dimuon-vertex 
requirements is 0.99 ±0.01 for the CC and CF channels. 

6. Dimuon mass efficiency 

The efficiency of the dimuon-mass requirements is esti- 
mated using simulated B° — > and B + — > J/ip K + 
events surviving the baseline, vertex, and trigger require- 
ments. Comparisons of the mean and width of the in- 
variant mass distribution using data and MC samples of 
J/tp — > yU + /i~ and B + — > J/ip (— > K + events re- 
veal discrepancies at the 10% level for the width, which 
are used to assign systematic uncertainties. Since the 
signal-search mass-windows correspond to ±2.5er m and 
have high efficiency, the systematic uncertainties negligi- 
bly affect the efficiency ratio, which is 1.00. 

7. B + — > J/ip K + reconstruction efficiency 

In addition to the dimuon efficiencies discussed above, 
the total reconstruction efficiency for the B + — > J/ip K + 
events also includes the efficiency for reconstructing the 
kaon as a COT track, for associating L00±SVXII hits 
to the COT track, and for the n + K + vertex require- 
ments. The kaon COT efficiency is estimated using the 
method of Sec. IV C II and is 0.964 ± 0.016 for both the 
CC and CF channels. This value is lower than that 
for muons due to inefficiencies resulting from kaon in- 
teraction with matter and due to the lower transverse- 
momentum threshold (plf > 1.0 GeV/c) employed for 
the kaons. The uncertainty includes variations of the rel- 
evant kaon-matter interaction cross sections; of the ma- 
terial modeling in the MC; and efficiencies as a function 
of kaon isolation, pt, and r\. The efficiency for associ- 
ating at least three rep hits from L00+SVXII to a good 
kaon COT track is evaluated using B + — s- J/ip K + data 
events. The B + — > J/ip K + signal yield is compared be- 
fore and after applying the L00±SVXII requirements on 
the kaon track and the ratio is used as a measure of the 
efficiency. The pair in the events must satisfy all 

the relevant dimuon baseline requirements of Sec. IIV Al 
including the silicon requirements discussed in Sec. lVC~4l 
and the dimuon vertex requirements of Sec. IV C 51 The 
resulting efficiencies are 0.942±0.002 and 0.948±0.003 for 
the CC and CF channels, respectively. The efficiency of 
the baseline requirements relevant for the /i + \i~ K + ver- 
tex is also directly determined using sideband-subtracted 
B+ J/ip K+ data and is 0.938±0.006 and 0.919±0.010 
for the CC and CF channels, respectively. 

8. Total reconstruction efficiency 

The total dimuon-reconstruction efficiency-ratio is the 
product over all the dimuon efficiency ratios described 
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above. The final efficiency ratio relevant for Eq. (fT} is 
obtained by including the product of the kaon reconstruc- 
tion efficiencies and the B + — ► J/ip K + vertex efficiency. 
The resulting ratios for the CC and CF channel and their 
associated total uncertainties are shown in Tabic [TTJ 



D. Efficiency of the NN selection 

The final NN selection criteria divide the surviving 
events into eight bins in z/jy The bin boundaries are 
determined in an optimization described in Sec. IVIIII 
The efficiency associated with each i/jv bin is estimated 
from simulated B® — > events meeting all other se- 

lection criteria. In Table [II] the efficiency summed over 
all eight bins (un > 0.70) and for the highest bin alone 
(fjv > 0.995) are shown for the CC and CF channels. For 
the likelihood fits described in Sec. IVIIII the efficiency 
given in Table IIIII is used in each bin separately. The NN 
efficiency determined from the B° — > MC sample 

is consistent with the results of Table IIIII Recall that the 
NN is not applied in selecting the B + —> J/ip K + sample 
relevant for the normalization in Eq. (TT]). 

The MC modeling of the NN efficiency is checked by 
comparing the signal efficiencies for B + — > J/tj} K + de- 
cays reconstructed in data and MC using the B + — > 
J ftp K+ MC sample described in Sec. IIV D 31 The com- 
parisons are shown in Figs. [TBI and [T4l and are quantified 
in Table IIVI Some difference in performance is observed 
and likely arises from radiation-damage-induced silicon- 
sensor degradation that is not completely simulated. For 
most NN bins the difference between MC and sideband- 
subtracted data does not exceed 2.5 times the associated 
statistical uncertainty. The most significant deviation oc- 
curs in the vn > 0.995 bin and is 3.4% and 7.0% for the 
CC and CF, respectively. These differences are assigned 
as systematic uncertainties to this bin. 

The distributions of simulated B + — > J/tp K + and 
B® —> events are weighted to match the pr and 

isolation distributions from B + —¥ J/tp K + and B® — > 
J/tp (f> data, respectively. A systematic uncertainty of 
4% is assigned as determined by varying the observed 
-B-meson px and isolation distributions within their sta- 
tistical uncertainties, repeating the weighting, and quan- 
tifying the resulting changes in the NN efficiencies. 



E. Standard model signal expectations 

The expected SM B® — > signal yield for each NN 

bin is given in Table [V] and is estimated using Eq. (TTJ) , 
the SM value of £>(£?!? -> /i + yU~) 0, the quantities from 
Table HI and the NN efficiencies of Table [TTTj to solve for 
Ngo. Combining all NN bins, approximately 1.4 and 1.0 
SM B® —> events are expected in the CC and CF 

channels, respectively. The expected SM B° —>■ 
yield is a factor of (f s /f u ) x (B(B° -> fi+^-)/B(B° -> 
fi + /j,~)) w 9 smaller. 



VI. BACKGROUND ESTIMATION 

The background falls into two classes. The domi- 
nant source of background in the B® — > search 
comes from accidental combinations of muon candi- 
dates that meet the selection requirements (combinato- 
rial background). In addition, a peaking background 
from B — > h + h'~ decays, where h and b! are either 
a pion or kaon, contributes. These two-body charm- 
less B decays are a more significant background for the 
B° —> search due to the downward shift in M^+p- 

causcd by assuming the muon mass for both charged par- 
ticles. The two classes of backgrounds are estimated for 
each (i/jv, M^+u- ) bin for the CC and CF channels sep- 
arately. 

A. Combinatorial backgrounds 

The combinatorial background is estimated using data 
events in the 5.009 < M^ +fx - < 5.169 GeV/c 2 and 
5.469 < M M + Al - < 5.969 GeV/c 2 sidebands. A fit to 
a straight line for the sum of sideband events over all 
eight vn bins, > 0.70, is shown in Fig. [T5] and is 
used to determined a fixed slope. For each bin the 
mass sidebands are fit to a straight line using the fixed 
slope but with a free floating normalization. The result- 
ing function is integrated over the relevant mass signal 
region to estimate the combinatorial background in that 
vn and M^^- bin. The fits in each NN bin are shown 
in Figures [IB] and [T7] for the CC and CF channels, re- 
spectively. This fixed-slope methodology significantly re- 
duces the uncertainty on the combinatorial background 
for the highest bins and is possible because 
and ispj are independent (cf. Sec. HVDTj) . We assign a 
6% systematic uncertainty associated with the statisti- 
cal uncertainty on the fixed slope. The statistical un- 
certainty from the normalization is also propagated into 
the background-estimate uncertainties and is a dominant 
contribution to the total uncertainty in the most sensitive 
NN bins. The final combinatorial background estimates 
are given in Tables I VII and IVIII 

Fully reconstructed B — > h + h'~ and partially recon- 
structed b — > /i + /i~ X decays can have kinematic proper- 
ties similar to B a s (B°) — > /-t + /i _ decays and thus can ob- 
tain large vn values. If these background processes con- 
tribute significantly to the sideband regions, they would 
invalidate the fixed-sloped methodology. The estimated 
B — > h + h'~ contribution to the sideband regions is less 
than 0.1 event in the CC and CF channels each. The 
partially reconstructed decays are constrained to have 
Ma+u- < M B o, and the lower edge of the lower side- 
band (5.009 GeV/c 2 ) is chosen to largely eliminate these 
events. To account for the possibility that these back- 
ground processes are affecting the combinatorial back- 
ground estimates in the highest i/jy bins, an additional 
systematic uncertainty is assessed for the three highest 
bins using alternative fits to the M^+^- sideband dis- 
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TABLE III: The NN efficiency for each vn bin for the CC and CF channeis after pr and isolation weighting. The uncertainties 
include only the statistical component. 



NN bin CC CF 

0.700 < v N < 0.760 2.2±0.1 2.3±0.1 

0.760 < v N < 0.850 4.1±0.1 4.4±0.1 

0.850 < v N < 0.900 2.9±0.1 3.4±0.1 

0.900 < u N < 0.940 4.5±0.1 4.7±0.1 

0.940 < u N < 0.970 8.3±0.1 6.2±0.1 

0.970 < v N < 0.987 10.9±0.1 10.2±0.1 

0.987 < u N < 0.995 12.5±0.1 8.5±0.1 

0.995 < v N < 1.000 46.1±0.3 46.8±0.3 



TABLE IV: Relative difference in NN bin efficiency between B + — > J/i/j K + data and MC. A positive (negative) difference 
indicates that the MC efficiency is higher (lower) than the data efficiency. The differences normalized to the associated statistical 
uncertainty are given in parenthesis. 



NN bin CC CF 

0.700 < v N < 0.760 -8.3% (-I.60-) -5.3% (-0.7a-) 

0.760 < v N < 0.850 -8.5% (-2.3<r) -7.9% (-1.4a) 

0.850 < v N < 0.900 4.0% (+0.9ct) -8.2% (-1.3a) 

0.900 < v N < 0.940 -0.5% (-O.lcr) 2.4% (+0.5<j) 

0.940 < v N < 0.970 0.1% (+0.1cr) -6.1% (-lAa) 

0.970 < v N < 0.987 2.9% (+l.lcr) 0.3% (+0.1<r) 

0.987 < v N < 0.995 4.4% (+2. la) -4.1% (-I.Oct) 

0.995 < v N < 1.000 3.4% (+2.6g) 7.0% (+3. la) 



tribution. An alternative straight-line fit is performed 
as described above except that all parameters of the fit 
are left floating. A second alternative fit is performed 
over the extended dimuon-mass sideband region, 4.669 < 
M M+/i - < 5.169 GcV/c 2 and 5.469 < M^ +A4 - < 5.969 
GeV/c 2 . By decreasing the lower edge of the lower side- 
band, a non-negligible number of partially reconstructed 
decays are admitted to the sideband sample in the highest 
vn bins and an exponential fit function is used. The al- 
ternative fits are shown in Figs. [TBI and IT91 for the CC and 
CF channels, respectively. For each alternative, the re- 
sulting combinatorial-background estimate is compared 
to the default estimates in Tables IVII and IVIII and the 
largest observed difference is assigned as an additional 
systematic uncertainty for each bin. In the highest vn 
bins, these systematic uncertainties are of the same mag- 
nitude as the statistical uncertainty on the normaliza- 
tion. The final systematic uncertainties range from ±19% 
(±3%) to ±43% (±42%) for the CC (CF) channel. 



B. Peaking backgrounds 



nificantly lower rate at which protons reach the muon de- 
tectors and satisfy the muon-identification requirements 
and due to smaller production cross-sections [351 ] . 

The B — > h + h'~ contribution to the dimuon-mass 
signal region is estimated using data to determine the 
pion- and kaon-misidentification rates and the methods 
of Sec. IV Al to determine the remaining acceptances and 
efficiencies. The resulting B —> h + h'~ background is 
about a factor of ten smaller than the combinatorial back- 
ground in the B® search, while comprising about half the 
total background in the B° search. 

The probability for pions and kaons to meet the muon 
identification requirements is extracted with a pure sam- 
ple of kaons and pions from D*-tagged D° — > K~tt + de- 
cays. These decays yield two same-sign pions, one from 
the D* + — > D°7r + decay and one from the subsequent D° 
decay, and an oppositely signed kaon. These charge cor- 
relations are used to identify the pions and kaons unam- 
biguously without needing to rely on any particle identi- 
fication criteria. The -D*-tagged event sample is collected 
using the first 7 fb" 1 of data with a trigger that requires 
two charged particles displaced from the beam line, each 
with pt > 2 GeV/c, that reconstruct to a secondary ver- 



Background from B — > h + h' decays must be esti- tex[4l|. We further require that the trigger particles have 



mated separately since they produce a peak in the M 
distribution and are not included in the combinatorial- 
background estimates described in Sec. IVI Al Decays in- 
volving B baryons, such as A& — > pir~ , are more heavily 
suppressed than the i?-meson background due to the sig- 



oppositc charge and satisfy the baseline tracking criteria 
described in Sec. lIV Al A kinematic fit is performed, con- 
straining the two tracks to a common vertex, that must 
satisfy x 2 /Adof < 15. The resulting D° K~ir + candi- 
date must have (77! < 1, an impact parameter less than 
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FIG. 15: Dimuon-mass distributions for vn > 0.7 for the CC and CF channels with extended signal-region blinded. The slopes 
from these fits are fixed and used to estimate the combinatorial background in each NN bin. 
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FIG. 16: Dimuon-mass distributions with fit overlaid for each of the eight NN bins for the CC channel with the extended 
signal-region blinded. The slope of each curve is taken from the fit in Fig. [15] while the normalization is determined in each bin 
separately. 
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TABLE V: The SM expected B° — > fi + /i signal contribution in each NN bin for the CC and CF channels separately. 



NN bin CC CF 

0.700 < v N < 0.760 0.04 ±0.01 0.03 ±0.01 

0.760 < v N < 0.850 0.07 ±0.01 0.05 ± 0.01 

0.850 < u N < 0.900 0.05 ±0.01 0.04 ±0.01 

0.900 < u N < 0.940 0.07 ±0.01 0.05 ± 0.01 

0.940 < v N < 0.970 0.10 ± 0.02 0.07 ±0.01 

0.970 < v N < 0.987 0.13 ± 0.02 0.11 ±0.02 

0.987 < u N < 0.995 0.20 ±0.04 0.09 ±0.02 

0.995 < v N < 1.000 0.75 ± 0.13 0.52 ± 0.10 
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FIG. 17: Dimuon-mass distributions with fit overlaid for each of the eight NN bins for the CF channel with the extended 
signal-region blinded. The slope of each curve is taken from the fit in Fig. [15] while the normalization is determined in each bin 
separately. 
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TABLE VI: Estimated number of combinatorial background events for the B® dimuon-mass signal-region for each (M„+„- , vn) 
bin and the associated statistical uncertainty. 

5.310-5.334 5.334-5.358 5.358-5.382 5.382-5.406 5.406-5.430 




NN bin 



0.700< 
0.760< 
0.850< 
0.900< 
0.940< 
0.970< 
0.987< 
0.995< 

0.700< 
0.760< 
0.850< 
0.900< 
0.940< 
0.970< 
0.987< 
0.995< 



v N <0. 
v N <0, 

VN <0, 

v N <0, 

V N <0 
V N <0. 
V N <0. 

CF 

vjv <0. 
i^jv <0. 

I<N <0 

^jv <0. 

2^iV <0. 
V N <0. 

<0. 
l/JV <1. 



760 
850 
900 
940 
970 
987 
995 
000 

700 
850 
900 
940 
070 
987 
005 
000 



10.42±0.72 
11.02±0.74 
4.69±0.46 
4.49±0.45 
3.85±0.41 
2.21±0.30 
0.92±0.19 
0.24±0.10 

10.18±0.72 
11.21±0.76 
6.11±0.54 
4.65±0.46 
3.94±0.42 
2.74±0.35 
0.83±0.19 
0.71±0.17 



10.33±0.71 
10.92±0.74 
4.65±0.45 
4.45±0.44 
3.81±0.41 
2.19±0.30 
0.91±0.19 
0.24±0.10 

10.05±0.71 
11.08±0.75 
6.03±0.53 
4.59±0.46 
3.90±0.42 
2.71±0.34 
0.82±0.18 
0.70±0.17 



10.23±0.70 
10.82±0.73 
4.61±0.45 
4.41±0.44 
3.78±0.40 
2.17±0.30 
0.91±0.19 
0.24±0.10 

9.93±0.70 
10.94±0.74 
5.96±0.52 
4.54±0.45 
3.85±0.41 
2.67±0.34 
0.81±0.18 
0.69±0.17 



10.14±0.70 
10.72±0.72 
4.56±0.44 
4.37±0.43 
3.74±0.40 
2.14±0.30 
0.90±0.19 
0.23±0.10 

9.80±0.69 
10.80±0.73 
5.88±0.52 
4.48±0.44 
3.80±0.41 
2.64±0.34 
0.80±0.18 
0.68±0.17 



10.04±0.69 
10.62±0.71 
4.52±0.44 
4.33±0.43 
3.71±0.39 
2.12±0.29 
0.89±0.19 
0.23±0.10 

9.68±0.68 
10.66±0.72 
5.81±0.51 
4.42±0.44 
3.75±0.40 
2.61±0.33 
0.79±0.18 
0.67±0.16 



TABLE VII: Estimated number of combinatorial background events for the B° dimuon-mass signal-region for each (M„+„- , vn) 
bin and the associated statistical uncertainty. 



_Mass bin (GeV/c 2 



NN bin 



5.219-5.243 5.243-5.267 5.267-5.291 5.291-5.315 5.315-5.339 



CC 






700< 


VN 


<() 


.760 





760< 


VN 


<0 


.850 





850< 


VN 


<0 


.900 





900< 


VN 


<() 


.940 





940 < 


VN 


<0 


.970 





970< 


VN 


<0 


.987 





987< 


VN 


<0 


.995 





995< 


VN 


<1 


.000 






CF 









700< 


VN 


<0 


.760 





760< 


VN 


<() 


.850 





850< 


VN 


<0 


.900 





900< 


VN 


<0 


.940 





940 < 


VN 


<() 


.970 





970< 


VN 


<0 


.987 





987< 


VN 


<() 


.995 





995< 


VN 


<1 


.000 



10.78±0.74 
11.41±0.77 
4.85±0.47 
4.64±0.46 
3.98±0.42 
2.28±0.32 
0.95±0.20 
0.25±0.10 

10.65±0.75 
11.73±0.80 
6.39±0.56 
4.87±0.48 
4.13±0.44 
2.87±0.36 
0.87±0.20 
0.74±0.18 



10.69±0.74 
11.30±0.76 
4.81±0.47 
4.60±0.46 
3.95±0.42 
2.26±0.31 
0.95±0.20 
0.25±0.10 

10.52±0.74 
11.60±0.79 
6.31±0.55 
4.81±0.48 
4.08±0.44 
2.83±0.36 
0.86±0.19 
0.73±0.18 



10.59±0.73 
11.21±0.76 
4.77±0.46 
4.56±0.45 
3.91±0.42 
2.24±0.31 
0.94±0.20 
0.24±0.10 

10.40±0.73 
11.46±0.78 
6.24±0.55 
4.75±0.47 
4.03±0.43 
2.80±0.35 
0.85±0.19 
0.72±0.18 



10.50±0.72 
11.10±0.75 
4.72±0.46 
4.52±0.45 
3.88±0.41 
2.22±0.31 
0.93±0.20 
0.24±0.10 

10.27±0.72 
11.32±0.77 
6.16±0.54 
4.70±0.47 
3.98±0.43 
2.77±0.35 
0.84±0.19 
0.71±0.17 



10.40±0.72 
11.00±0.74 
4.68±0.46 
4.48±0.44 
3.84±0.41 
2.20±0.30 
0.92±0.19 
0.24±0.10 

10.15±0.72 
11.18±0.76 
6.09±0.53 
4.64±0.46 
3.94±0.42 
2.73±0.35 
0.83±0.19 
0.70±0.17 



100 /xm, and 1.77 < M Kw < 1-97 GcV/c 2 . For surviving 
events we associate a third track, the soft pion from the 
D*+ -> D°tt decay, with p T > 0.4 GeV/c, |z | < 1.5 cm, 
and | do | < 600 /zm and form a three-track vertex that 
must satisfy 144 < M K7T7r - M Kn < 147 McV/c 2 . The 
resulting sample of kaons and pions from the D° decay 
has a purity of > 99% and is used to estimate the effi- 
ciency of the muon-identification requirements for kaons 
and pions, or fake rates. 

The pion (kaon) fake rate is determined as the ratio 
of the number of D° candidates with a pion (kaon) track 
that satisfies the muon identification criteria of Sec. lIV Al 
to the total number of D° candidates. The fake rates 
for 7r + , 7T _ , K + , and K~ are measured as a function of 
instantaneous luminosity and px for central and forward 
nations separately. The 7r + and 7r~ rates are found to be 
consistent and are combined to yield an average ir^ fake 
rate. The number of events in which one track meets the 



muon selection criteria is estimated by fitting the Mk-k 
mass distribution to a function that is the sum of a Gaus- 
sian distribution and first-order polynomial. The number 
of events where both tracks fail the muon-sclcction cri- 
teria is determined by fitting the Mk-k distribution to 
the sum of two Gaussian distributions and a first-order 
polynomial. Figures [20] and [5T] show the Mktt distribu- 
tions with fits overlaid for a lower and higher kaon pt 
bin for the central-muon-detector system combining all 
luminosity bins. 

Time-dependent changes in fake rates can occur due 
to changes in the accelerator performance affecting the 
instantaneous luminosity or due to differences in detec- 
tor performance associated with aging or changes in the 
operational configuration. The instantaneous luminosity 
was found to be the primary source of the fake-rate time 
dependence. We perform a consistency check to inves- 
tigate other sources. The fake rates, binned in pt and 
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FIG. 18: Dimuon mass-sideband distributions with alterna- 
tive fits overlaid for the three highest NN bins in the CC 
channel with the extended signal region blinded. 



FIG. 19: Dimuon mass-sideband distributions with alterna- 
tive fits overlaid for the three highest NN bins in the CF 
channel with the extended signal region blinded. 



instantaneous luminosity, are applied to the D*-tagged 
sample as weights. The sum of weights is then compared 
to the actual number of fakes in bins of calendar date re- 
sulting in differences of up to 20%. This 20% difference 
is assigned as a systematic uncertainty and accounts for 
the largest contribution to the fake-rate uncertainty. In 
the final determination of the total B — > h + h!~ contri- 
bution a weighted average of fake rates is used, based on 
the instantaneous-luminosity profile of the dimuon-mass- 
sideband events. The luminosity-averaged fake rates are 
shown in Fig. [2U 

The expected number of peaking background events 
for a specific two-body hadronic decay is given by 

Ei, f xcco 

M — TP — lb 

where F s b is the ratio of relevant fragmentation fractions 
depending on which signal and background channels are 
being evaluated (i.e., B° -> h + h'~ or B° -> h + h'~) 
and on which signal-search mass- window is selected (i.e., 
Bg —> fx + \i~ or B° — s- For B° hadronic back- 

grounds in the B° search and for B® hadronic back- 
grounds in the B® search this factor equals unity. For 
B° hadronic backgrounds in the B® search region F s b = 
fd/fs, while for hadronic backgrounds in the B° 
search F 8 f, = f s /fd- The value for fd/f s is taken 
from Ref. (3(| and has a 13% uncertainty. The value 
of the branching fraction B b for a specific background 
mode and its associated uncertainty are also taken from 
Ref. (36|. For unobserved B — > h + h'~ processes, the 



current branching-fraction upper-limits are used and a 
100% uncertainty is assigned. The S s term corresponds 
to the single-event-sensitivity for B® — > as defined 

in Sec. IVl and is taken from Eq. ([T]). The last term cor- 
rects S s for the differing reconstruction efficiencies be- 
tween B° -t n + y~ (e^S ) and the relevant B -> h + h'~ 
decay (e T b eco )- In particular, the muon-stub reconstruc- 
tion and identification efficiencies discussed in Secs. lVC~2l 
and IV C 31 are replaced by the relevant double-track pion 
and kaon fake-rate estimates. The double-track fake rate 
is estimated from the convolution of the single-track fake 
rates with the (pj. , p^. ) spectra obtained from a sample 
of B —> h + h'~ MC events meeting the baseline criteria 
and using the instantaneous-luminosity distribution ob- 
served in the sideband events. A systematic un- 
certainty of 35% is assigned based on the 20% single-track 
uncertainty. The efficiency for the reconstructed invari- 
ant mass falling into a given M„+„- bin is corrected for 
differences between B® — > and B h + h'~ decays 
using simulated B — > h + h'~ events surviving the base- 
line criteria weighted by their double-track fake rates. 
All other relevant efficiencies are found to be consistent 
between B® — > and B — > h + h'~ decays. The final 
peaking background estimates are given in Tables I Villi 
and US 

The vn distribution is assumed to be the same for both 
B —> h + h!~ and B® —> since they feature similar 

kinematic properties and the z/jv does not use any muon- 
identification criteria as input variables. We verify that 
the pt dependence of the fake rates negligibly affects the 
i/jv distribution. 
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TABLE VIII: The B — > h + h'~ background estimates and their total uncertainty for the B® signal window for each (M M + M - , vn) 
bin. The contributions are negligibly small in all the lower vn bins. Uncertainties less than 0.001 are given as 0.000 in the 
table. 



/ / 5.310-5.334 5.334-5.358 5.358-5.382 5.382-5.406 5.406-5.430 
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TABLE IX: The B — > h + h' background estimates and their total uncertainty for the B signal window for each (M„+„- , vn) 
bin. Uncertainties less than 0.001 are given as 0.000 in the table. 
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C. Background estimate checks with control 
samples 

The methods used to predict the background rates 
are validated using statistically-independent background- 
dominated data samples. The control samples are de- 
signed to reproduce the salient features of the combi- 
natorial and B — > h + h'~ backgrounds. Given a signal 
sample that consists of two opposite-charge muons with 
A > 0, we form four independent control samples, 

OS—: opposite-sign muon pairs, passing the baseline re- 
quirements with A < 0; 

SS+: same-sign muon pairs with A > and relaxed 
trigger-matching to improve event sample size; 

SS — : same-sign muon pairs with A < and relaxed 
trigger-matching to improve event sample size; 



FM+: opposite-sign fake-muon-enhanccd pairs, in which 
at least one track of which is required to fail the 
muon likelihood or dE/dx requirement with A > 0. 



The OS— sample is representative of combinatorial 
backgrounds with short lifetime, which have a symmetric 
lifetime distribution around zero. The same-sign samples 
are dominated by events in which a muon from a scmilcp- 
tonic decay of a B hadron is combined with a muon from 
the sequential semileptonic decay b — > cX — > /iX of the 
other B meson in the event and by events in which muons 
are combined from non-sequential processes. The FM+ 
sample is enriched in B — > h + h'~ background due to 
the reversal of the muon-identification requirements. To 
mimic the Af2 and A distributions of the A > samples, 
we apply the transformations A — > — A and AO — > n — Ail 
to the A < samples. 
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FIG. 20: The Mk, distributions with fits overlaid for central 
kaons with 2.0 < pr < 2.8 GeV/c. The top panels contain 
the distributions for D° candidates with kaons that fail the 
muon requirements, while the bottom panels contain the dis- 
tributions for D candidates with kaons that meet the muon 
ID requirements. Distributions for K~ {K + ) are on the left 
(right). 
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FIG. 22: Fake rates as a function of pr and averaged over 
instantaneous luminosity for central (top) and forward (bot- 
tom) muon detectors. The left panels show the fake rates for 
pions while the right panels show the fake rate for kaons. 



The background contribution to each control sample 
is estimated using the same methods as described in 
Sec. rvTAl and IVTBl For the OS- and SS± samples, 
only the combinatorial background is estimated due to 
the dominance of this component over the peaking back- 
ground. For the FM+ sample, we estimate both the com- 
binatorial and the B — s- h + h'~ background. New fake- 
rates are evaluated using the relaxed muon-identification 
criteria and the method described in Sec. IVI Bl In all 
cases the backgrounds are evaluated for the extended 
Mfj,+ti- signal region for each bin and then com- 
pared to the observed number of events. The Poisson 
probability for making an observation at least as large 
as found in the extended signal region, given the pre- 
dicted background and its systematic uncertainty, is cal- 
culated. These probabilities are expected to be uniformly 
distributed between and 100% for a set of independent 
samples. The resulting comparisons for all bins and 
control samples are shown in Table [X] 

No large deviation between the predicted number of 
background events and the observed number of events is 
seen, providing confidence in the methods employed to 
estimate the background rates. 



FIG. 21: The Mk-k distributions with fits overlaid for central 
kaons with 6.0 < pr < 8.0 GeV/c. The top panels contain 
the distributions for D° candidates with kaons that fail the 
muon requirements, while the bottom panels contain the dis- 
tributions for D candidates with kaons that meet the muon 
ID requirements. Distributions for K~ (K + ) are on the left 
(right). 



VII. SYSTEMATIC UNCERTAINTIES 

The systematic uncertainties are either related to un- 
certainties in efficiencies, acceptances, normalization fac- 
tors, pcaking-background estimates, or combinatorial- 
background estimates. Table IXJl summarizes all system- 
atic uncertainties. 

The dominant systematic uncertainty among the ef- 
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TABLE X: A comparison of the predicted (Pred.) and observed (Obs.) number of events in the extended M M + M - signal region 
as a function of vm bin for the various control samples. The uncertainties correspond to the uncertainty on the mean of the 
background prediction. The Poisson probability (Prob.) for making an observation at least as large as the observed yield is 
also shown. In cases where no events are observed, the probability is actually the Poisson probability to observe zero events 
assuming a Poisson mean equal to the predicted mean. 



Sample 






NN bin 






Pred. 


CC 
Obs. 


Prob.(%) 


Pred. 


CF 

Obs. Prob.(%) 







.700 




VjSl 







,760 268.8±14.3 


249 


82.3 


261.8±13.9 






OS- 





.760 




Vjm 







850 320.8±16.1 


282 


95.1 


399.0±18.5 


3Q7 


^ 9 







.850 


< 


V N 


< 





900 


i c;n Q_|_q q 


156 


36.5 


193.5±11.4 


185 


68.3 







.900 


< 


V N 


< 





94D 


1 Af\ 9+q 7 


158 


23.0 


177.4±10.8 


183 


37.7 







.940 


< 




< 


0. 


970 


1 Af\ 9+q 7 


137 


72.9 


156.8±10.1 


1 A ^ 
14o 


SI 9 
Ol.Z 







.970 






< 





987 


1 no A-\-'J Q 


98 


58.3 


112.6±8.2 




00.0 







.987 











995 


7«3 Q_|_fi Q 


59 


97.0 


53.3±5.4 


Do 


0.0 







.995 


< 


UN 


< 


1 


000 


41.2±4.8 


42 


47.2 


48.2±5.1 


1 1 


70 
* U.U 







.700 




VN 







.760 


4.8±1.2 


3 


81.8 


0.9±0.5 





8 Q 
o.y 


SS+ 





.760 




VN 







,850 


3.6±1.0 


5 


30.6 


5.1±1.2 





^ A 
00.4 







.850 


< 


VN 


< 





900 


2.4±0.8 


5 


12.2 


0.9±0.5 


6 


0.2 







.900 


< 


v N 


< 





940 


1.5±0.7 


3 


21.3 


0.9±0.5 


1 


56.8 







.940 


< 


vm 


< 





970 


1 5+0 7 

. 0-i—\J . i 


1 


73.3 


0.9±0.5 


1 


OO.O 







.970 




vn 







987 




2 


51.3 


0.9±0.5 


n 
u 


AC\ 7 
4U. / 







.987 


< 




< 





995 


3 





74.1 


0.3±0.3 


U 


7/1 1 
(4.1 
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< 


vn 


< 


1 


000 


3 





74.1 


0.3±0.3 


1 
1 


^0 
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Vn 


»• 





,760 


7.8±1.5 


10 


27.8 


6.0±1.3 


A 
4 


SO Q 


ss- 





.760 




Vn 


<: 





,850 


10.5±1.8 


11 


47.2 
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7 


00.0 
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< 


vn 
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4.2±1.1 


7 


15.9 


3.0±0.9 


2 


75.8 







.900 


< 


v n 


< 





,940 


3.6±1.0 


4 


47.2 


0.9±0.5 


7 


0.1 







.940 


< 


v n 


< 





970 


3.3±1.0 


6 


14.3 


3.6±1.0 


2 


83.4 







.970 


< 


VN 


< 





,987 


3.0±0.9 


3 


55.0 


2.4±0.8 


5 


12.2 







.987 


< 


v N 


< 





,995 


2.1±0.8 





12.2 


1.2±0.6 





30.1 







.995 


< 


v N 


< 


1 


000 


1.2±0.6 


1 


65.9 


1.8±0.7 





16.5 







.700 


< 


v N 


< 





,760 


152.2±9.9 


161 


29.6 


66.5±6.1 


88 


2.5 


FM+ 





.760 


< 


v N 


< 





,850 


140.9±9.5 


157 


15.3 


81.7±6.9 


70 


70.0 







.850 


< 


v N 


< 





,900 


65.2±6.1 


50 


94.4 


44.7±5.0 


34 


91.6 







.900 


< 


VN 


< 





,940 


48.7±5.2 


40 


85.8 


24.4±3.6 


38 


2.3 







.940 


< 


VN 


< 





970 


27.7±3.8 


24 


73.1 


12.7±2.6 


20 


7.1 







.970 


< 


VN 


< 





,987 


10.9±2.3 


12 


41.4 


7.7±2.0 


13 


8.8 







.987 


< 


VN 


< 





,995 


11.0±2.3 


4 


98.3 


2.7±1.1 


3 


48.3 







.995 


< 


V N 


< 


1 


,000 


28.3±4.1 


32 


30.6 


4.4±1.6 


8 


13.0 



ficiencies, acceptances, and normalization factors is the 
13% uncertainty on the ratio of 6-quark fragmentation 
fractions, f u /f s [36|. The second largest systematic un- 
certainty is about a factor of two smaller and is due to 
the acceptance ratio (Sec. IV A"| . 

An additional systematic uncertainty, the B a s {B Q ) 
mass-shape uncertainty, is assigned based on the prob- 
ability for a B®(B°) candidate to populate the dimuon- 
mass signal-search window. This uncertainty is based 
on the world average B®(B° )-mass uncertainty and the 
COT momentum scale and resolution. The final uncer- 
tainty ranges between 1% and 9% depending on M Al + |U - 



bin. 

The leading systematic uncertainty for the peaking 
background is the 35% uncertainty assigned to the 
double-track fake rate as discussed in Sec. IVIBI In addi- 
tion, an uncertainty associated with the branching frac- 
tion of each B — > h + h'~ decay is taken from Ref. [36| . 
Branching fractions for which only upper limits are 
known are assigned a 100% relative uncertainty. These 
decays, however, contribute a small fraction to the to- 
tal B h + h'~ background. The leading contribution 
to the B —> h + h'~ background in the B° signal-search 
mass- window comes from B° — > K + it~, B° — > 7r + 7r~, 
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and B® — > K + K~ . The branching fractions of these 
decays have relative uncertainties of 3%, 4%, and 16%, 
respectively. The B — > h + h'~ background estimates 
that require /d// s are treated as correlated with the 
B° —> normalization (cf. Eq. ([T])). 

The leading systematic uncertainty, up to 43%, in 
the combinatorial background estimates is due to the 
dimuon-mass-sideband shape-uncertainty assigned to the 
three highest vn bins. The dimuon-mass-sideband shape- 
uncertainty is discussed in IVI A[ Another large source 
of systematic uncertainty, up to 42% in the highest vn 
bins, is due to finite sample-size in the M M + M - sidebands. 
A relatively small contribution (sa 6%) to the total- 
combinatorial-background systematic uncertainty arises 
from the uncertainty on the fixed slope used in the esti- 
mates. 

In the statistical interpretation of results all the above 
systematic uncertainties are taken as nuisance param- 
eters with multidimensional Gaussian constraints that 
include correlations. Combinatorial background yields 
across all (-M„+„- , vm) bins in the CC or CF channel 
are correlated because they are fit using the same slope. 
Combinatorial background estimates in a given bin 
are additionally correlated across the five mass bins be- 
cause their normalization is determined from the same 
set of M„+„- sideband events. Peaking backgrounds are 
treated as correlated across all bins and across the CC 
and CF channels, due to the use of common fake-rates. 
All the acceptances and efficiencies of Eq. ([1) are treated 
as correlated across i>n and „- bins and across the 
CC and CF channels due to the use of common MC and 
data control samples. 



VIII. ANALYSIS OPTIMIZATION 

We optimize the vn and M /J + M - binning by minimizing 
the expected 95% C.L. upper limit on B{B° S — > 
assuming only background. The expected limit is calcu- 
lated using a modified frcqucntist methodology, the CLs 
method [421 ], combining all the vn and M M + M - bins in 
both the CC and CF channels, while taking correlations 
between bins into account. 

For events surviving all selection criteria, two likeli- 
hood fits are performed, one assuming a background-only 
hypothesis, with the likelihood £(&), and one assuming a 
signal-plus-background hypothesis, with likelihood £(s + 
b). A log-likelihood ratio LLR = -2 In (£(s + b)/£(b)) 
is calculated. The likelihoods are defined as the prod- 
uct of Poisson probabilities over all (M„+u- , ^iv) bins 
in both the CC and CF channels. For each bin, the 
mean number of expected events is estimated assuming 
only background for £(&) and assuming signal plus back- 
ground for C(s + b), while the number of observed events 
is taken from the number of surviving events falling into 
that bin. Systematic uncertainties are treated as nui- 
sance parameters with Gaussian constraints. At a fixed 
B(B® — > the likelihoods are minimized by vary- 



ing the nuisance parameters. We denote the minimum of 
the log-likelihood ratio as LLR min . 

Expected limits are calculated using an ensemble 
of background-only simulated data sets corresponding 
in size to the actual data set used in this analysis. 
The number of contributing background events in each 
(M^+^j- , i/jv) bin is drawn from a Poisson distribution 
whose mean corresponds to the values in Table I VII for 
the combinatorial background and in Table IVIIII for the 
peaking background. The mean values shown in the ta- 
bles are varied by their systematic uncertainties taking 
into account correlations between bins. Once assembled, 
each simulated data set is treated just like the exper- 
imental data. The median CLs as a function of as- 
sumed B(B® — > fi + fi~) is used to determine the expected 
95% confidence level (C.L.) upper limit. For alternative 
choices of the (M M +^- , vn) bins, the methods of Sec. IVI Al 
and Sec. IVI Bl are used to generate a mean background- 
expectation for each bin and to update the systematic 
uncertainties, while the methods of Sec. IV Al are used to 
generate the corresponding signal acceptance. 

The optimization process is iterative. While fixing the 
M„+„- bins, it begins with many bins and we then 
combine neighboring bins with similar expected signal- 
to-background ratios. The bin boundaries of the result- 
ing eight vn bins are then varied to minimize the ex- 
pected B{B° S — > limit. Finally the M^+ M - bins 
are varied. The resulting expected B{B® — > limit 
is not significantly dependent on the exact choice of bin 
boundaries and varies by less than 5% over reasonable 
variations of the bin definitions. The final configuration 
results in eight bins and five M^+^- bins. The vn 
bins are 0.700 < v N < 0.760, 0.760 < v N < 0.850, 
0.850 < v N < 0.900, 0.900 < v N < 0.940, 0.940 < 
v N < 0.970, 0.970 < u N < 0.987, 0.987 < v N < 0.995, 
and 0.995 < vm < 1.000. The five mass bins for the 
B° s (B°) search are 5.310 < M M+M - < 5.334, 5.334 < 
< 5.358, 5.358 < M^- < 5.382, 5.382 < 
AV M - < 5.406, and 5.406 < M M+M - < 5.430 GcV/c 2 
(5.219 < M M+M - < 5.243, 5.243 < M^- < 5.267, 
5.267 < M M+A4 - < 5.291, 5.291 < M M+/1 - < 5.315, and 
5.315 < Mp+p- < 5.339 GeV/c 2 ). This optimization 
reduces the expected limit by approximately 20% com- 
pared to using a single bin with i>n > 0.7. The final 
expected-upper-limit for the B® (B ) search is 1.3 x 10 -8 
(4.2 x 10~ 9 ) at 95% C.L. and 1.0 x 10~ 8 (3.4 x 10~ 9 ) at 
90% C.L. 



IX. RESULTS 

The background estimates, systematic uncertainties on 
the background estimates, and the observed number of 
events for the B a —> search are given in Table IXlTl 

and summarized in Fig. [531 An ensemble of simulated 
experiments assuming the background-only hypothesis 
and including the effects of systematic uncertainties is 
used to estimate the probability that backgrounds alone 
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TABLE XI: Summary of systematic uncertainties. 



Category 


Quantity 


CC (%) CF (%) 


Source 






6 


7 


b mass, renorm. scale, fragmentation modeling 




e£? T /e£° T 

B+ ' B° 


1 


1 


Isolation, p r p{B^) : detector effects 


Efficiencies, 
acceptance, and 
normalization 


pid /,Pib 

B+ / B° 
C SVXII /J3VXII 
e B+ l e B° 


3 
3 
3 


3 
3 
3 


— y jx~ and J 1 'ip — y fi^ fi~ differences 
B^ — > J/ip and J/ip — ► fx fM~ differences 
Pt(m)j two-muon opening-angle, track isolation 


factors 


,COT 
e K 


1.7 


1.7 


Isolation, pr(B), detector effects 






4 


4 


B isolation, pt{B) 




e NN for i/jv > 0.995 


3.4 


7.0 


Data-MC differences 




/«/ fs 


13 


13 


Ref. [36] 




B{B+ -> J/V) -> f J L + fj,-K+) 


4 


4 


Ref. [36] 




B-meson mass shape 


0.1-9 


0.1-9 


Mass resolution, mass scale, Ref. [36] 


B -> h+h'- 


Fake rates 


20 


20 


Detector and luminosity effects 


B{B^h+h'-) 


3-100 


3-100 


Ref. [36] 




Slope 


6 


6 


Fit uncertainty 



Comb. Bkg 



Normalization 
Shape 



7-42 7-25 Sideband sample-size 

10-43 3-42 Comparison of different fit functions 



could produce a LLR min value at least as small as the 
one observed in the data. The resulting p-value for the 
B° f.i + fJ,~ search is 41%, indicating that the observed 
events are consistent with the background expectations. 
The observed upper limits are shown in Fig. [53] and 
are obtained using the CLs method to give 4.6 x 10~ 9 
(3.8 x 10- 9 ) at 95% (90%) C.L. 

The results of the B® — > search are shown in 

Table IXIIII and summarized in Fig. [23l A small excess 
of events is observed in the CC channel and populates 
the most sensitive vn and M„+„- bins. The probability 
that background alone could yield a LLR m j n value smaller 
than that observed is 0.94%, which corresponds to an ex- 
cess greater than 2a. Using the expression A\ 2 = LLR — 
LLR m j n we perform a fit to estimate B(B® —> 
The fit, shown in Fig. [23 includes systematic uncertain- 
ties and gives B{B° -> fJ, + /JT) = l-3t° 9 7 x 10~ 8 at 68% 
C.L. and 0.8 x 10~ 9 < B(B° -> /i+/J~) < 3.4 x 10~ 8 at 
95% C.L. A Baycsian method yields very similar results. 
The probability that the SM, including signal, could yield 
a value of LLR m j n smaller than that observed in the data 
is 6.8% as determined using an ensemble of simulated ex- 
periments that include signal and background contribu- 
tions, assuming the SM value for B(B® and 
including the effects of systematic uncertainties. The 
observed upper limits from the CLs methodology are 
3.1 x 10~ 8 (2.7 x 10~ 8 ) at 95% (90%) C.L. and are shown 
in Fig. [551 Relative to the previous analysis [l8| , which 



also reported a small excess using 30% less data, the sig- 
nificance of the excess has diminished and the estimate 
of B(B° -» is closer to the SM value. 

An excess is observed in the two highest NN bins 
of the CC channel, the most sensitive bins. The total 
background expectations for the 0.987 < vn < 0.995 
and 0.995 < v N < 1.000 bins are 4.56 and 1.29 events 
while the SM expected signal yields are 0.75 and 0.20 
events, respectively. We observe a total of 6 and 4 
events, respectively, for these bins. As a check of con- 
sistency we redo the B(B® — > /i + /j~) fit using only the 
two highest NN bins. This yields a central value of 
B(B° S -> fi+fJ,-) = l-Oto.l x 10 ~ 8 > consistent with the 
full fit. When considering only the two highest NN bins, 
the probability that background-only (background plus 
SM signal) could yield a log-likelihood ratio smaller than 
that observed in data is 2% (22%) including the effects 
of systematic uncertainties. 

We also observe a data excess in the 0.970 < vm < 
0.987 bin of the CC channel in the B® —> search, 
where no significant signal contribution is expected. Note 
that such an excess does not appear in the B° — > 
search. This excess originated in the previous _B? — > 
ji + pr analysis and was thoroughly investigated ^M,. It 
was concluded that the excess in this bin was not caused 
by a problem with the background estimates, a NN bias, 
or any mis-modeling of the data and was likely due to 
a statistical upward fluctuation. This conclusion is sup- 
ported by Fig. [23 which compares the observed data to 
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TABLE XII: The results for the B° — > fi + fi~ search comparing the expected total backgrounds and their uncertainty (Exp.) 
to the number of observed events (Obs.) in each (fjv,Af M + M _) bin. For each vm bin, the sum over mass bins is also shown. 
The CC and CF channels are given separately. 



CC channel results: 

Mfi+fj,- bin (GeV/c^) 





V N 


bin 




5.310-5.334 


5.334-5.358 


5.358-5.382 


5.382-5.406 


5.406-5.430 


Sum 










Exp. 


10.80±0.74 


1 (\ T("1 1 f\ <~f A 

10.70±0.74 


10.61±0.73 


10.51±0.72 


10.41±0.72 


53.02 





.700- 





.760 


Obs. 


15 


14 


10 


7 


11 


57 










Exp. 


11.43±0.77 


11.33±0.76 


11.23±0.75 


11.12±0.75 


11.01±0.74 


56.12 





.760- 





.850 


Obs. 


12 


10 


7 


8 


9 


46 










Exp. 


4.88±0.47 


4.83±0.47 


4.78±0.46 


4.73±0.46 


4.68±0.46 


23.90 





.850- 





.900 


Obs. 


10 


3 


6 


6 


5 


30 










Exp. 


4.68±0.46 


4.63±0.46 


4.59±0.45 


4.54±0.45 


4.49±0.44 


22.92 





.900- 





.940 


Obs. 


6 


10 


6 


8 


6 


36 










Exp. 


4.03±0.42 


3.99±0.42 


3.94±0.42 


3.89±0.41 


3.85±0.41 


19.70 





.940- 





.970 


Obs. 


2 


3 


4 


4 


5 


18 










Exp. 


2.34±0.39 


2.31±0.39 


2.28±0.38 


2.24±0.38 


2.21±0.38 


11.39 





.970- 





.987 


Obs. 


2 


2 


3 


1 


3 


11 










Exp. 


1.04±0.27 


1.03±0.27 


1.00±0.27 


0.96±0.26 


0.93±0.26 


4.96 





.987- 





.995 


Obs. 


4 


2 


2 


1 


1 


10 










Exp. 


0.57±0.19 


0.54±0.17 


0.47±0.16 


0.37±0.15 


0.29±0.14 


2.24 





.995- 


1 


.000 


Obs. 


2 


1 


1 





1 


5 



CF channel results: 

M M+M - bin (GeV/c^) 





V N 


bin 




5.310-5.334 


5.334-5.358 


5.358-5.382 


5.382-5.406 


5.406-5.430 


Sum 










Exp. 


10.65±0.75 


10.53±0.74 


10.40±0.73 


10.28±0.73 


10.15±0.72 


52.01 





.700- 





.760 


Obs. 


8 


13 


12 


16 


10 


59 










Exp. 


11.74±0.80 


11.61±0.79 


11.47±0.78 


11.33±0.77 


11.19±0.76 


57.33 





.760- 





.850 


Obs. 


9 


13 


13 


13 


12 


60 










Exp. 


6.40±0.56 


6.32±0.55 


6.24±0.55 


6.17±0.54 


6.09±0.53 


31.22 





.850- 





.900 


Obs. 


3 


4 


3 


2 


1 


13 










Exp. 


4.88±0.48 


4.82±0.48 


4.76±0.47 


4.70±0.47 


4.64±0.46 


23.80 





.900- 





.940 


Obs. 


3 


8 


7 


8 


5 


31 










Exp. 


4.14±0.44 


4.09±0.44 


4.04±0.43 


3.99±0.43 


3.94±0.42 


20.20 





.940- 





.970 


Obs. 


5 


7 


2 


1 


2 


17 










Exp. 


2.89±0.46 


2.85±0.46 


2.82±0.45 


2.78±0.45 


2.74±0.44 


14.07 





.970- 





.987 


Obs. 


2 


1 


3 


1 


4 


11 










Exp. 


0.88±0.26 


0.87±0.25 


0.86±0.25 


0.85±0.25 


0.83±0.24 


4.30 





.987- 





.995 


Obs. 


4 





1 





1 


6 










Exp. 


0.82±0.37 


0.81±0.36 


0.79±0.36 


0.75±0.35 


0.72±0.35 


3.89 





.995- 


1 


.000 


Obs. 


1 











1 


2 



the background expectations for the B® — » search 
for the 3 fb _1 of data added for this analysis. No evidence 
of an excess in the 0.970 < < 0.987 bin is found in 
the data added since the analysis of Ref. [lU . 

Our B° — > and B® — > /i + /J~ results are consis- 

tent with the bounds set by other experiments and with 
the SM expectations. This is demonstrated in Fig. [28] 
for the B® — > result, where the small correc- 

tions (< 10%) suggested by the recent work in Refs. [43j 
and [4J1 have not been considered. 



X. CONCLUSION 

We report on the search for B® — s- fi + fJ,~ and B° —> 
decays using the full CDF Run II data set. These 
are the most sensitive searches for these decays at the 
Tevatron. For the B° — > search, the observed 

data are in agreement with background-only expecta- 
tions and an upper limit of B(B° — > yU + ^~) < 4.6 x 10 -9 
(3.8 x 10- 9 ) at 95% (90%) C.L. is set. For the B° s 

search, a small excess of events is observed relative 
to expectations from background-only sources with a p- 
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FIG. 23: The background estimate (light gray) and its systematic uncertainty (hashed area) is compared to the data (points), 
and their Poisson uncertainties (error bars on points) for the CC and CF channels for the B° — > fi + fi~ (top) and B® — > 
(bottom) searches. Expectations that include signal at 4.1 times the SM rate (dark gray), corresponding to the fitted value 
from Fig. 1251 are also shown for the B® results. The lowest five NN bins have been combined because the signal sensitivity is 
concentrated in the highest three NN bins. 



value of 0.94% (6.8%) assuming only background (back- 
ground plus SM signal). Using a fit to the data we mea- 
sure B(B° S -> fj, + fj,~) = l-3±o'.7 x 10_8 and the following 
bounds arc set, 2.2 x 10" 9 < B{B° S -> < 3.0x 10~ 8 

and 0.8 x 10~ 9 < B{B° S -> m + aO < 3.4 x 10" 8 at 90% 
and 95% C.L., respectively. These measurements are con- 
sistent with our previous result, the recent results from 
other experiments, and the SM expectations. 

These are the final CDF results for searches for these 
rare FCNC decays and are the culmination of a pro- 
gram spanning nearly two decades. The sensitivity of 
the -B 9 — > analysis reported here is better than 

the pioneering measurement by CDF [45[ by a factor of 



over 800, which exceeds by a factor of 35 the gain in sen- 
sitivity expected by just increasing the sample size. The 
gains in search sensitivity originated from continual im- 
provements to the analysis techniques employed. Those 
techniques are described in detail to afford future exper- 
iments performing similar searches or measurements the 
opportunity to benefit from this research. These results 
form the most sensitive search for B®(B°) — > de- 
cays performed previous to the LHC operational period 
and remain competitive with the most recent LHC re- 
sults. The B® and B° results from all experiments are 
compatible with one another, indicate that there is no 
strong enhancement to the B® — > decay rate, and 
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TABLE XIII: The results for the B° — > fj. + fJ,~ search comparing the expected total backgrounds and their uncertainty (Exp.) 
to the number of observed events (Obs.) in each {vn, ) bin. For each vm bin, the sum over mass bins is also shown. 

The CC and CF channels are given separately. 



CC channel results: 

Mfi+fj,- bin (GeV/c^) 





V N 


bin 




5.310-5.334 


5.334-5.358 


5.358-5.382 


5.382-5.406 


5.406-5.430 


Sum 










Exp. 


10.43±0.72 


10.33±0.71 


10.23±0.71 


10.14±0.70 


1 A A < 1 f \ f l\ 

10.04±0.69 


51.17 





.700- 





.760 


Obs. 


13 


8 


7 


6 


7 


41 










Exp. 


11.03±0.74 


10.93±0.74 


10.82±0.73 


10.72±0.72 


10.62±0.72 


54.13 





.760- 





.850 


Obs. 


9 


8 


12 


15 


8 


52 










Exp. 


4.70±0.46 


4.65±0.45 


4.61±0.45 


4.56±0.44 


4.52±0.44 


23.03 





.850- 





.900 


Obs. 


6 


8 


5 


6 


5 


30 










Exp. 


4.50±0.45 


4.45±0.44 


4.41±0.44 


4.37±0.43 


4.33±0.43 


22.05 





.900- 





.940 


Obs. 


5 


5 


5 


6 


8 


29 










Exp. 


3.86±0.41 


3.82±0.41 


3.78±0.40 


3.74±0.40 


3.71±0.39 


18.91 





.940- 





.970 


Obs. 


5 


7 


2 


3 


4 


21 










Exp. 


2.22±0.38 


2.19±0.37 


2.17±0.37 


2.15±0.37 


2.12±0.36 


10.84 





.970- 





.987 


Obs. 


1 


4 


8 


2 


3 


18 










Exp. 


0.94±0.26 


0.92±0.26 


0.91±0.26 


0.90±0.25 


0.89±0.25 


4.56 





.987- 





.995 


Obs. 


1 


1 


3 


1 





6 










Exp. 


0.31±0.14 


0.26±0.14 


0.25±0.14 


0.24±0.14 


0.23±0.14 


1.29 





.995- 


1 


.000 


Obs. 





1 


2 





1 


4 



CF channel results: 

M M+M - bin (GeV/c^) 





bin 




5.310-5.334 


5.334-5.358 


5.358-5.382 


5.382-5.406 


5.406-5.430 


Sum 






Exp. 


10.18±0.72 


10.05±0.71 


9.93±0.7 


9.80±0.69 


9.68±0.68 


49.64 


0.700- 


-0.760 


Obs. 


10 


16 


12 


11 


10 


59 






Exp. 


11.22±0.76 


11.08±0.75 


10.94±0.74 


10.8±0.73 


10.67±0.72 


54.71 


0.760- 


0.850 


Obs. 


8 


13 


9 


13 


4 


47 






Exp. 


6.11±0.54 


6.03±0.53 


5.96±0.52 


5.88±0.52 


5.81±0.51 


29.79 


0.850 


-0.90 


Obs. 


1 


5 


9 


3 


6 


24 






Exp. 


4.65±0.46 


4.60±0.46 


4.54±0.45 


4.48±0.44 


4.42±0.44 


22.69 


0.900- 


-0.940 


Obs. 


6 


2 


8 


5 


4 


25 






Exp. 


3.95±0.42 


3.90±0.42 


3.85±0.41 


3.80±0.41 


3.75±0.40 


19.25 


0.940- 


-0.970 


Obs. 


1 


6 


3 


4 


5 


19 






Exp. 


2.75±0.44 


2.71±0.44 


2.68±0.43 


2.64±0.43 


2.61±0.42 


13.38 


0.970- 


-0.987 


Obs. 


1 


6 


3 


1 


3 


14 






Exp. 


0.83±0.25 


0.82±0.24 


0.81±0.24 


0.80±0.24 


0.79±0.23 


4.06 


0.987- 


-0.995 


Obs. 


1 


1 


1 


1 





4 






Exp. 


0.73±0.35 


0.71±0.34 


0.69±0.34 


0.68±0.34 


0.67±0.33 


3.48 


0.995- 


-1.000 


Obs. 


1 


1 





1 


1 


4 



strongly constrain new physics models that predict sig- 
nificant deviations from the standard model I14L 
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FIG. 24: Distribution of 1-CLs as a function of B(B° -> 
fi + /j,~). The expected (observed) limit at 95% C.L. is de- 
termined by the intersection of the black (blue) points with 
the line at 1-CLs = 0.05. The shaded regions indicate the 
spread of limits obtained from simulated experiments due to 
fluctuations in the background alone. 
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FIG. 26: Distribution of 1-CLs as a function of B(B° -> 
The expected (observed) limit at 95% C.L. is de- 
termined by the intersection of the black (blue) points with 
the line at 1-CLs = 0.05. The shaded regions indicate the 
spread of limits obtained from simulated experiments due to 
fluctuations in the background alone. 
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FIG. 25: Ax 2 distribution as a function of B{B ( l — > ). 
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The SM prediction @], at 68% C.L., is shown as a thin grey vertical band. 
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